Command line
python /home/saxelrod/Repo/projects/chemprop/chemprop/train.py --config_path /home/saxelrod/chemprop_cov_1/models/cp_cov_bce/config.json --data_path /home/saxelrod/chemprop_cov_1/train_full.csv --dataset_type classification
Args
{'activation': 'ReLU',
 'aggregation': 'mean',
 'aggregation_norm': 100,
 'atom_descriptors': None,
 'atom_descriptors_path': None,
 'atom_descriptors_size': 0,
 'atom_features_size': 0,
 'atom_messages': False,
 'batch_size': 50,
 'bias': False,
 'cache_cutoff': 10000,
 'checkpoint_dir': None,
 'checkpoint_path': None,
 'checkpoint_paths': None,
 'class_balance': True,
 'config_path': '/home/saxelrod/chemprop_cov_1/models/cp_cov_bce/config.json',
 'crossval_index_dir': None,
 'crossval_index_file': None,
 'crossval_index_sets': None,
 'cuda': True,
 'data_path': '/home/saxelrod/chemprop_cov_1/train_full.csv',
 'dataset_type': 'classification',
 'depth': 6,
 'device': device(type='cuda', index=0),
 'dropout': 0.2,
 'ensemble_size': 1,
 'epochs': 10000,
 'extra_metrics': [],
 'features_generator': None,
 'features_only': False,
 'features_path': None,
 'features_scaling': False,
 'features_size': None,
 'ffn_hidden_size': 2300,
 'ffn_num_layers': 3,
 'final_lr': 0.0001,
 'folds_file': None,
 'gpu': 0,
 'grad_clip': None,
 'hidden_size': 2300,
 'ignore_columns': None,
 'init_lr': 0.0001,
 'log_frequency': 10,
 'max_data_size': None,
 'max_lr': 0.001,
 'metric': 'binary_cross_entropy',
 'metrics': ['binary_cross_entropy'],
 'minimize_score': True,
 'mpn_shared': False,
 'multiclass_num_classes': 3,
 'no_cache_mol': False,
 'no_cuda': False,
 'no_features_scaling': True,
 'num_folds': 1,
 'num_lrs': 1,
 'num_tasks': 1,
 'num_workers': 8,
 'number_of_molecules': 1,
 'pytorch_seed': 0,
 'quiet': True,
 'save_dir': '/home/saxelrod/chemprop_cov_1/models/cp_cov_bce',
 'save_preds': False,
 'save_smiles_splits': False,
 'seed': 0,
 'separate_test_features_path': None,
 'separate_test_path': '/home/saxelrod/chemprop_cov_1/test_full.csv',
 'separate_val_features_path': None,
 'separate_val_path': '/home/saxelrod/chemprop_cov_1/val_full.csv',
 'show_individual_scores': False,
 'smiles_columns': [None],
 'split_sizes': (0.8, 0.1, 0.1),
 'split_type': 'random',
 'target_columns': None,
 'task_names': ['sars_cov_one_cl_protease_active'],
 'test': False,
 'test_fold_index': None,
 'train_data_size': None,
 'undirected': False,
 'use_input_features': False,
 'val_fold_index': None,
 'warmup_epochs': 2.0}
Loading data
Number of tasks = 1
Fold 0
Splitting data with seed 0
Class sizes
sars_cov_one_cl_protease_active 0: 99.84%, 1: 0.16%
Total size = 167,255 | train size = 167,255 | val size = 55,750 | test size = 55,756
With class_balance, effective train size = 550
Building model 0
MoleculeModel(
  (sigmoid): Sigmoid()
  (encoder): MPN(
    (encoder): ModuleList(
      (0): MPNEncoder(
        (dropout_layer): Dropout(p=0.2, inplace=False)
        (act_func): ReLU()
        (W_i): Linear(in_features=147, out_features=2300, bias=False)
        (W_h): Linear(in_features=2300, out_features=2300, bias=False)
        (W_o): Linear(in_features=2433, out_features=2300, bias=True)
      )
    )
  )
  (ffn): Sequential(
    (0): Dropout(p=0.2, inplace=False)
    (1): Linear(in_features=2300, out_features=2300, bias=True)
    (2): ReLU()
    (3): Dropout(p=0.2, inplace=False)
    (4): Linear(in_features=2300, out_features=2300, bias=True)
    (5): ReLU()
    (6): Dropout(p=0.2, inplace=False)
    (7): Linear(in_features=2300, out_features=1, bias=True)
  )
)
Number of parameters = 21,813,201
Moving model to cuda
Epoch 0
Loss = 6.9865e-01, PNorm = 97.7173, GNorm = 1.3198, lr_0 = 1.0148e-04
Validation binary_cross_entropy = 0.697022
Epoch 1
Loss = 6.7343e-01, PNorm = 97.7294, GNorm = 1.4953, lr_0 = 1.0283e-04
Validation binary_cross_entropy = 0.846888
Epoch 2
Loss = 6.7120e-01, PNorm = 97.7397, GNorm = 1.7226, lr_0 = 1.0417e-04
Validation binary_cross_entropy = 0.833405
Epoch 3
Loss = 6.5658e-01, PNorm = 97.7482, GNorm = 3.1685, lr_0 = 1.0552e-04
Validation binary_cross_entropy = 0.854527
Epoch 4
Loss = 6.5856e-01, PNorm = 97.7562, GNorm = 1.1620, lr_0 = 1.0686e-04
Validation binary_cross_entropy = 0.542713
Epoch 5
Loss = 6.6444e-01, PNorm = 97.7639, GNorm = 1.0403, lr_0 = 1.0821e-04
Validation binary_cross_entropy = 0.476516
Epoch 6
Loss = 6.2234e-01, PNorm = 97.7718, GNorm = 1.0426, lr_0 = 1.0955e-04
Validation binary_cross_entropy = 0.644991
Epoch 7
Loss = 6.4125e-01, PNorm = 97.7797, GNorm = 2.7917, lr_0 = 1.1090e-04
Validation binary_cross_entropy = 0.516473
Epoch 8
Loss = 6.4542e-01, PNorm = 97.7866, GNorm = 0.6450, lr_0 = 1.1224e-04
Validation binary_cross_entropy = 0.658575
Epoch 9
Loss = 4.9185e-01, PNorm = 97.7961, GNorm = 0.8170, lr_0 = 1.1359e-04
Loss = 6.3766e-01, PNorm = 97.8045, GNorm = 0.7448, lr_0 = 1.1493e-04
Validation binary_cross_entropy = 0.729551
Epoch 10
Loss = 6.1780e-01, PNorm = 97.8120, GNorm = 1.1499, lr_0 = 1.1628e-04
Validation binary_cross_entropy = 0.433498
Epoch 11
Loss = 5.9830e-01, PNorm = 97.8210, GNorm = 1.9870, lr_0 = 1.1762e-04
Validation binary_cross_entropy = 0.822883
Epoch 12
Loss = 6.2045e-01, PNorm = 97.8297, GNorm = 1.3574, lr_0 = 1.1897e-04
Validation binary_cross_entropy = 0.630560
Epoch 13
Loss = 6.0522e-01, PNorm = 97.8390, GNorm = 0.5869, lr_0 = 1.2031e-04
Validation binary_cross_entropy = 0.470142
Epoch 14
Loss = 5.9082e-01, PNorm = 97.8496, GNorm = 1.4696, lr_0 = 1.2166e-04
Validation binary_cross_entropy = 0.596153
Epoch 15
Loss = 5.9875e-01, PNorm = 97.8611, GNorm = 2.0216, lr_0 = 1.2300e-04
Validation binary_cross_entropy = 0.358614
Epoch 16
Loss = 5.6808e-01, PNorm = 97.8745, GNorm = 3.1456, lr_0 = 1.2435e-04
Validation binary_cross_entropy = 0.424700
Epoch 17
Loss = 5.5084e-01, PNorm = 97.8876, GNorm = 1.8831, lr_0 = 1.2570e-04
Validation binary_cross_entropy = 0.711069
Epoch 18
Loss = 5.5468e-01, PNorm = 97.9007, GNorm = 1.9539, lr_0 = 1.2704e-04
Validation binary_cross_entropy = 0.375985
Epoch 19
Loss = 5.9953e-01, PNorm = 97.9163, GNorm = 1.2267, lr_0 = 1.2839e-04
Loss = 5.7175e-01, PNorm = 97.9314, GNorm = 1.6913, lr_0 = 1.2973e-04
Validation binary_cross_entropy = 0.621234
Epoch 20
Loss = 5.6402e-01, PNorm = 97.9495, GNorm = 2.3855, lr_0 = 1.3108e-04
Validation binary_cross_entropy = 0.603089
Epoch 21
Loss = 5.8419e-01, PNorm = 97.9656, GNorm = 1.8643, lr_0 = 1.3242e-04
Validation binary_cross_entropy = 0.446110
Epoch 22
Loss = 5.3762e-01, PNorm = 97.9832, GNorm = 1.8370, lr_0 = 1.3377e-04
Validation binary_cross_entropy = 0.477009
Epoch 23
Loss = 5.5351e-01, PNorm = 98.0027, GNorm = 1.5105, lr_0 = 1.3511e-04
Validation binary_cross_entropy = 0.374966
Epoch 24
Loss = 4.8738e-01, PNorm = 98.0214, GNorm = 1.0433, lr_0 = 1.3646e-04
Validation binary_cross_entropy = 0.402772
Epoch 25
Loss = 4.9567e-01, PNorm = 98.0424, GNorm = 1.1660, lr_0 = 1.3780e-04
Validation binary_cross_entropy = 0.310702
Epoch 26
Loss = 5.3479e-01, PNorm = 98.0631, GNorm = 1.8649, lr_0 = 1.3915e-04
Validation binary_cross_entropy = 0.528143
Epoch 27
Loss = 4.5300e-01, PNorm = 98.0844, GNorm = 1.2190, lr_0 = 1.4049e-04
Validation binary_cross_entropy = 0.357552
Epoch 28
Loss = 5.3009e-01, PNorm = 98.1062, GNorm = 2.8732, lr_0 = 1.4184e-04
Validation binary_cross_entropy = 0.248059
Epoch 29
Loss = 5.4295e-01, PNorm = 98.1235, GNorm = 3.8114, lr_0 = 1.4318e-04
Loss = 4.9265e-01, PNorm = 98.1429, GNorm = 1.8700, lr_0 = 1.4453e-04
Validation binary_cross_entropy = 0.365345
Epoch 30
Loss = 5.2504e-01, PNorm = 98.1631, GNorm = 2.6930, lr_0 = 1.4587e-04
Validation binary_cross_entropy = 0.400885
Epoch 31
Loss = 4.7716e-01, PNorm = 98.1865, GNorm = 2.3743, lr_0 = 1.4722e-04
Validation binary_cross_entropy = 0.463419
Epoch 32
Loss = 4.9030e-01, PNorm = 98.2068, GNorm = 2.0814, lr_0 = 1.4857e-04
Validation binary_cross_entropy = 0.578164
Epoch 33
Loss = 4.7857e-01, PNorm = 98.2294, GNorm = 2.2478, lr_0 = 1.4991e-04
Validation binary_cross_entropy = 0.320471
Epoch 34
Loss = 4.9205e-01, PNorm = 98.2511, GNorm = 1.0006, lr_0 = 1.5126e-04
Validation binary_cross_entropy = 0.438500
Epoch 35
Loss = 4.7203e-01, PNorm = 98.2731, GNorm = 3.1434, lr_0 = 1.5260e-04
Validation binary_cross_entropy = 0.336711
Epoch 36
Loss = 4.3087e-01, PNorm = 98.2977, GNorm = 1.2831, lr_0 = 1.5395e-04
Validation binary_cross_entropy = 0.246307
Epoch 37
Loss = 5.0863e-01, PNorm = 98.3212, GNorm = 2.3873, lr_0 = 1.5529e-04
Validation binary_cross_entropy = 0.482185
Epoch 38
Loss = 4.8722e-01, PNorm = 98.3459, GNorm = 2.1884, lr_0 = 1.5664e-04
Validation binary_cross_entropy = 0.662358
Epoch 39
Loss = 3.8249e-01, PNorm = 98.3731, GNorm = 3.7170, lr_0 = 1.5798e-04
Loss = 4.6514e-01, PNorm = 98.3980, GNorm = 1.3490, lr_0 = 1.5933e-04
Validation binary_cross_entropy = 0.564653
Epoch 40
Loss = 4.2811e-01, PNorm = 98.4236, GNorm = 2.8611, lr_0 = 1.6067e-04
Validation binary_cross_entropy = 0.417946
Epoch 41
Loss = 4.3436e-01, PNorm = 98.4489, GNorm = 1.2183, lr_0 = 1.6202e-04
Validation binary_cross_entropy = 0.451536
Epoch 42
Loss = 4.1902e-01, PNorm = 98.4759, GNorm = 4.2424, lr_0 = 1.6336e-04
Validation binary_cross_entropy = 0.297896
Epoch 43
Loss = 3.5868e-01, PNorm = 98.5069, GNorm = 4.2644, lr_0 = 1.6471e-04
Validation binary_cross_entropy = 0.480627
Epoch 44
Loss = 4.6608e-01, PNorm = 98.5380, GNorm = 7.3829, lr_0 = 1.6605e-04
Validation binary_cross_entropy = 0.390936
Epoch 45
Loss = 3.9820e-01, PNorm = 98.5687, GNorm = 3.0343, lr_0 = 1.6740e-04
Validation binary_cross_entropy = 0.471826
Epoch 46
Loss = 3.6331e-01, PNorm = 98.6039, GNorm = 3.4285, lr_0 = 1.6874e-04
Validation binary_cross_entropy = 0.363856
Epoch 47
Loss = 3.8092e-01, PNorm = 98.6348, GNorm = 3.4876, lr_0 = 1.7009e-04
Validation binary_cross_entropy = 0.331835
Epoch 48
Loss = 3.6929e-01, PNorm = 98.6678, GNorm = 1.2512, lr_0 = 1.7143e-04
Validation binary_cross_entropy = 0.454751
Epoch 49
Loss = 4.2923e-01, PNorm = 98.7039, GNorm = 3.1346, lr_0 = 1.7278e-04
Loss = 3.1974e-01, PNorm = 98.7384, GNorm = 6.0585, lr_0 = 1.7413e-04
Validation binary_cross_entropy = 0.162209
Epoch 50
Loss = 4.3234e-01, PNorm = 98.7708, GNorm = 5.0253, lr_0 = 1.7547e-04
Validation binary_cross_entropy = 0.244032
Epoch 51
Loss = 3.8103e-01, PNorm = 98.8098, GNorm = 1.3058, lr_0 = 1.7682e-04
Validation binary_cross_entropy = 0.418333
Epoch 52
Loss = 3.4787e-01, PNorm = 98.8459, GNorm = 1.1868, lr_0 = 1.7816e-04
Validation binary_cross_entropy = 0.648391
Epoch 53
Loss = 3.2373e-01, PNorm = 98.8776, GNorm = 3.7668, lr_0 = 1.7951e-04
Validation binary_cross_entropy = 0.303430
Epoch 54
Loss = 3.4453e-01, PNorm = 98.9126, GNorm = 1.5832, lr_0 = 1.8085e-04
Validation binary_cross_entropy = 0.226562
Epoch 55
Loss = 3.1545e-01, PNorm = 98.9487, GNorm = 4.1123, lr_0 = 1.8220e-04
Validation binary_cross_entropy = 0.408323
Epoch 56
Loss = 2.6775e-01, PNorm = 98.9816, GNorm = 1.8604, lr_0 = 1.8354e-04
Validation binary_cross_entropy = 0.182031
Epoch 57
Loss = 4.0794e-01, PNorm = 99.0181, GNorm = 2.2756, lr_0 = 1.8489e-04
Validation binary_cross_entropy = 0.301760
Epoch 58
Loss = 2.3337e-01, PNorm = 99.0567, GNorm = 1.5070, lr_0 = 1.8623e-04
Validation binary_cross_entropy = 0.247766
Epoch 59
Loss = 4.0193e-01, PNorm = 99.0939, GNorm = 4.7511, lr_0 = 1.8758e-04
Loss = 3.4594e-01, PNorm = 99.1278, GNorm = 3.4144, lr_0 = 1.8892e-04
Validation binary_cross_entropy = 0.263412
Epoch 60
Loss = 2.8230e-01, PNorm = 99.1619, GNorm = 3.1634, lr_0 = 1.9027e-04
Validation binary_cross_entropy = 0.561964
Epoch 61
Loss = 2.6703e-01, PNorm = 99.1964, GNorm = 1.6784, lr_0 = 1.9161e-04
Validation binary_cross_entropy = 0.489072
Epoch 62
Loss = 2.5457e-01, PNorm = 99.2297, GNorm = 2.2522, lr_0 = 1.9296e-04
Validation binary_cross_entropy = 0.265564
Epoch 63
Loss = 2.0571e-01, PNorm = 99.2661, GNorm = 6.6750, lr_0 = 1.9430e-04
Validation binary_cross_entropy = 0.467453
Epoch 64
Loss = 2.9836e-01, PNorm = 99.2986, GNorm = 2.4728, lr_0 = 1.9565e-04
Validation binary_cross_entropy = 0.261119
Epoch 65
Loss = 3.0609e-01, PNorm = 99.3327, GNorm = 2.2877, lr_0 = 1.9700e-04
Validation binary_cross_entropy = 0.360464
Epoch 66
Loss = 3.1812e-01, PNorm = 99.3745, GNorm = 2.0860, lr_0 = 1.9834e-04
Validation binary_cross_entropy = 0.187050
Epoch 67
Loss = 2.5244e-01, PNorm = 99.4133, GNorm = 2.3795, lr_0 = 1.9969e-04
Validation binary_cross_entropy = 0.133457
Epoch 68
Loss = 2.3641e-01, PNorm = 99.4472, GNorm = 4.5209, lr_0 = 2.0103e-04
Validation binary_cross_entropy = 0.221390
Epoch 69
Loss = 1.5289e-01, PNorm = 99.4849, GNorm = 1.4708, lr_0 = 2.0238e-04
Loss = 2.4325e-01, PNorm = 99.5232, GNorm = 2.6303, lr_0 = 2.0372e-04
Validation binary_cross_entropy = 0.165264
Epoch 70
Loss = 2.2117e-01, PNorm = 99.5563, GNorm = 2.2801, lr_0 = 2.0507e-04
Validation binary_cross_entropy = 0.286266
Epoch 71
Loss = 2.4387e-01, PNorm = 99.5910, GNorm = 10.7117, lr_0 = 2.0641e-04
Validation binary_cross_entropy = 0.826105
Epoch 72
Loss = 4.0510e-01, PNorm = 99.6274, GNorm = 2.9876, lr_0 = 2.0776e-04
Validation binary_cross_entropy = 0.226252
Epoch 73
Loss = 2.6895e-01, PNorm = 99.6702, GNorm = 3.1434, lr_0 = 2.0910e-04
Validation binary_cross_entropy = 0.153337
Epoch 74
Loss = 1.9730e-01, PNorm = 99.7091, GNorm = 1.6392, lr_0 = 2.1045e-04
Validation binary_cross_entropy = 0.585313
Epoch 75
Loss = 1.8852e-01, PNorm = 99.7435, GNorm = 5.3135, lr_0 = 2.1179e-04
Validation binary_cross_entropy = 0.097292
Epoch 76
Loss = 2.0178e-01, PNorm = 99.7804, GNorm = 2.5828, lr_0 = 2.1314e-04
Validation binary_cross_entropy = 0.364694
Epoch 77
Loss = 2.0911e-01, PNorm = 99.8204, GNorm = 2.1520, lr_0 = 2.1448e-04
Validation binary_cross_entropy = 0.261158
Epoch 78
Loss = 1.3799e-01, PNorm = 99.8605, GNorm = 2.2860, lr_0 = 2.1583e-04
Validation binary_cross_entropy = 0.189326
Epoch 79
Loss = 2.1988e-01, PNorm = 99.9040, GNorm = 3.8147, lr_0 = 2.1717e-04
Loss = 2.5281e-01, PNorm = 99.9437, GNorm = 11.4828, lr_0 = 2.1852e-04
Validation binary_cross_entropy = 0.257661
Epoch 80
Loss = 2.4613e-01, PNorm = 99.9881, GNorm = 2.6144, lr_0 = 2.1987e-04
Validation binary_cross_entropy = 0.149832
Epoch 81
Loss = 2.0630e-01, PNorm = 100.0317, GNorm = 2.1480, lr_0 = 2.2121e-04
Validation binary_cross_entropy = 0.336846
Epoch 82
Loss = 2.3701e-01, PNorm = 100.0719, GNorm = 3.7138, lr_0 = 2.2256e-04
Validation binary_cross_entropy = 0.130459
Epoch 83
Loss = 2.2028e-01, PNorm = 100.1131, GNorm = 4.1926, lr_0 = 2.2390e-04
Validation binary_cross_entropy = 0.175914
Epoch 84
Loss = 1.6125e-01, PNorm = 100.1476, GNorm = 2.3729, lr_0 = 2.2525e-04
Validation binary_cross_entropy = 0.368848
Epoch 85
Loss = 1.5388e-01, PNorm = 100.1885, GNorm = 4.0069, lr_0 = 2.2659e-04
Validation binary_cross_entropy = 0.177586
Epoch 86
Loss = 2.0122e-01, PNorm = 100.2281, GNorm = 4.3615, lr_0 = 2.2794e-04
Validation binary_cross_entropy = 0.257760
Epoch 87
Loss = 2.0820e-01, PNorm = 100.2609, GNorm = 4.8316, lr_0 = 2.2928e-04
Validation binary_cross_entropy = 0.170057
Epoch 88
Loss = 1.8263e-01, PNorm = 100.2964, GNorm = 3.8554, lr_0 = 2.3063e-04
Validation binary_cross_entropy = 0.122159
Epoch 89
Loss = 1.2309e-01, PNorm = 100.3419, GNorm = 1.8523, lr_0 = 2.3197e-04
Loss = 1.4141e-01, PNorm = 100.3844, GNorm = 1.9861, lr_0 = 2.3332e-04
Validation binary_cross_entropy = 0.200339
Epoch 90
Loss = 1.0178e-01, PNorm = 100.4218, GNorm = 2.3043, lr_0 = 2.3466e-04
Validation binary_cross_entropy = 0.232366
Epoch 91
Loss = 1.5104e-01, PNorm = 100.4533, GNorm = 2.0368, lr_0 = 2.3601e-04
Validation binary_cross_entropy = 0.212870
Epoch 92
Loss = 1.5020e-01, PNorm = 100.4851, GNorm = 3.2411, lr_0 = 2.3735e-04
Validation binary_cross_entropy = 0.133002
Epoch 93
Loss = 1.6947e-01, PNorm = 100.5162, GNorm = 3.3788, lr_0 = 2.3870e-04
Validation binary_cross_entropy = 0.280738
Epoch 94
Loss = 2.1166e-01, PNorm = 100.5515, GNorm = 8.2393, lr_0 = 2.4004e-04
Validation binary_cross_entropy = 0.074346
Epoch 95
Loss = 1.5735e-01, PNorm = 100.5904, GNorm = 3.8572, lr_0 = 2.4139e-04
Validation binary_cross_entropy = 0.210323
Epoch 96
Loss = 2.1022e-01, PNorm = 100.6281, GNorm = 2.0756, lr_0 = 2.4274e-04
Validation binary_cross_entropy = 0.201828
Epoch 97
Loss = 1.1754e-01, PNorm = 100.6618, GNorm = 2.9435, lr_0 = 2.4408e-04
Validation binary_cross_entropy = 0.178494
Epoch 98
Loss = 1.9594e-01, PNorm = 100.6943, GNorm = 2.4952, lr_0 = 2.4543e-04
Validation binary_cross_entropy = 0.220221
Epoch 99
Loss = 1.9396e-01, PNorm = 100.7327, GNorm = 3.8993, lr_0 = 2.4677e-04
Loss = 2.0683e-01, PNorm = 100.7712, GNorm = 3.9930, lr_0 = 2.4812e-04
Validation binary_cross_entropy = 0.121709
Epoch 100
Loss = 1.5886e-01, PNorm = 100.8175, GNorm = 3.3735, lr_0 = 2.4946e-04
Validation binary_cross_entropy = 0.330186
Epoch 101
Loss = 1.6201e-01, PNorm = 100.8615, GNorm = 1.9812, lr_0 = 2.5081e-04
Validation binary_cross_entropy = 0.235338
Epoch 102
Loss = 1.2971e-01, PNorm = 100.9024, GNorm = 4.0042, lr_0 = 2.5215e-04
Validation binary_cross_entropy = 0.185076
Epoch 103
Loss = 1.3954e-01, PNorm = 100.9371, GNorm = 1.6174, lr_0 = 2.5350e-04
Validation binary_cross_entropy = 0.247456
Epoch 104
Loss = 1.8103e-01, PNorm = 100.9735, GNorm = 7.9963, lr_0 = 2.5484e-04
Validation binary_cross_entropy = 0.153465
Epoch 105
Loss = 1.3645e-01, PNorm = 101.0155, GNorm = 2.6625, lr_0 = 2.5619e-04
Validation binary_cross_entropy = 0.090008
Epoch 106
Loss = 1.2786e-01, PNorm = 101.0542, GNorm = 2.5148, lr_0 = 2.5753e-04
Validation binary_cross_entropy = 0.321730
Epoch 107
Loss = 1.1681e-01, PNorm = 101.0938, GNorm = 1.8198, lr_0 = 2.5888e-04
Validation binary_cross_entropy = 0.531131
Epoch 108
Loss = 1.6350e-01, PNorm = 101.1312, GNorm = 4.2238, lr_0 = 2.6022e-04
Validation binary_cross_entropy = 0.227367
Epoch 109
Loss = 1.6818e-01, PNorm = 101.1758, GNorm = 3.4929, lr_0 = 2.6157e-04
Loss = 1.7029e-01, PNorm = 101.2125, GNorm = 2.6820, lr_0 = 2.6291e-04
Validation binary_cross_entropy = 0.109822
Epoch 110
Loss = 1.1177e-01, PNorm = 101.2512, GNorm = 3.3950, lr_0 = 2.6426e-04
Validation binary_cross_entropy = 0.152774
Epoch 111
Loss = 8.7971e-02, PNorm = 101.2854, GNorm = 2.2462, lr_0 = 2.6561e-04
Validation binary_cross_entropy = 0.126052
Epoch 112
Loss = 9.7411e-02, PNorm = 101.3174, GNorm = 6.5543, lr_0 = 2.6695e-04
Validation binary_cross_entropy = 0.367837
Epoch 113
Loss = 2.6638e-01, PNorm = 101.3486, GNorm = 5.2149, lr_0 = 2.6830e-04
Validation binary_cross_entropy = 0.107620
Epoch 114
Loss = 2.0032e-01, PNorm = 101.3880, GNorm = 4.6503, lr_0 = 2.6964e-04
Validation binary_cross_entropy = 0.078169
Epoch 115
Loss = 1.9408e-01, PNorm = 101.4336, GNorm = 2.9759, lr_0 = 2.7099e-04
Validation binary_cross_entropy = 0.114639
Epoch 116
Loss = 8.3088e-02, PNorm = 101.4745, GNorm = 0.8373, lr_0 = 2.7233e-04
Validation binary_cross_entropy = 0.113972
Epoch 117
Loss = 5.9920e-02, PNorm = 101.5165, GNorm = 1.4389, lr_0 = 2.7368e-04
Validation binary_cross_entropy = 0.111418
Epoch 118
Loss = 1.4870e-01, PNorm = 101.5443, GNorm = 3.7651, lr_0 = 2.7502e-04
Validation binary_cross_entropy = 0.159488
Epoch 119
Loss = 3.6154e-02, PNorm = 101.5698, GNorm = 1.2330, lr_0 = 2.7637e-04
Loss = 1.5676e-01, PNorm = 101.6053, GNorm = 5.3475, lr_0 = 2.7771e-04
Validation binary_cross_entropy = 0.097813
Epoch 120
Loss = 1.5135e-01, PNorm = 101.6373, GNorm = 2.9022, lr_0 = 2.7906e-04
Validation binary_cross_entropy = 0.245575
Epoch 121
Loss = 1.7443e-01, PNorm = 101.6746, GNorm = 5.0365, lr_0 = 2.8040e-04
Validation binary_cross_entropy = 0.343614
Epoch 122
Loss = 1.3934e-01, PNorm = 101.7195, GNorm = 1.2613, lr_0 = 2.8175e-04
Validation binary_cross_entropy = 0.190595
Epoch 123
Loss = 9.8515e-02, PNorm = 101.7628, GNorm = 1.9959, lr_0 = 2.8309e-04
Validation binary_cross_entropy = 0.338219
Epoch 124
Loss = 1.3585e-01, PNorm = 101.8057, GNorm = 3.8294, lr_0 = 2.8444e-04
Validation binary_cross_entropy = 0.280448
Epoch 125
Loss = 1.6269e-01, PNorm = 101.8401, GNorm = 2.1978, lr_0 = 2.8578e-04
Validation binary_cross_entropy = 0.206318
Epoch 126
Loss = 1.1460e-01, PNorm = 101.8767, GNorm = 2.6089, lr_0 = 2.8713e-04
Validation binary_cross_entropy = 0.142824
Epoch 127
Loss = 6.2491e-02, PNorm = 101.9197, GNorm = 1.6506, lr_0 = 2.8848e-04
Validation binary_cross_entropy = 0.143025
Epoch 128
Loss = 2.1218e-01, PNorm = 101.9587, GNorm = 6.2649, lr_0 = 2.8982e-04
Validation binary_cross_entropy = 0.138275
Epoch 129
Loss = 2.2895e-02, PNorm = 101.9929, GNorm = 0.6810, lr_0 = 2.9117e-04
Loss = 8.4924e-02, PNorm = 102.0305, GNorm = 4.8079, lr_0 = 2.9251e-04
Validation binary_cross_entropy = 0.161421
Epoch 130
Loss = 1.2774e-01, PNorm = 102.0616, GNorm = 3.3243, lr_0 = 2.9386e-04
Validation binary_cross_entropy = 0.278560
Epoch 131
Loss = 1.2394e-01, PNorm = 102.1017, GNorm = 1.5809, lr_0 = 2.9520e-04
Validation binary_cross_entropy = 0.168359
Epoch 132
Loss = 1.0750e-01, PNorm = 102.1519, GNorm = 2.5829, lr_0 = 2.9655e-04
Validation binary_cross_entropy = 0.132988
Epoch 133
Loss = 1.5539e-01, PNorm = 102.2001, GNorm = 2.1516, lr_0 = 2.9789e-04
Validation binary_cross_entropy = 0.064269
Epoch 134
Loss = 1.2933e-01, PNorm = 102.2472, GNorm = 2.2337, lr_0 = 2.9924e-04
Validation binary_cross_entropy = 0.102014
Epoch 135
Loss = 1.1713e-01, PNorm = 102.2916, GNorm = 2.2612, lr_0 = 3.0058e-04
Validation binary_cross_entropy = 0.101045
Epoch 136
Loss = 1.5865e-01, PNorm = 102.3352, GNorm = 2.0121, lr_0 = 3.0193e-04
Validation binary_cross_entropy = 0.239953
Epoch 137
Loss = 2.0534e-01, PNorm = 102.3802, GNorm = 2.1865, lr_0 = 3.0327e-04
Validation binary_cross_entropy = 0.210789
Epoch 138
Loss = 2.2337e-01, PNorm = 102.4282, GNorm = 3.6846, lr_0 = 3.0462e-04
Validation binary_cross_entropy = 0.147302
Epoch 139
Loss = 1.0797e-01, PNorm = 102.4773, GNorm = 1.4322, lr_0 = 3.0596e-04
Loss = 5.7601e-02, PNorm = 102.5221, GNorm = 1.9960, lr_0 = 3.0731e-04
Validation binary_cross_entropy = 0.109407
Epoch 140
Loss = 7.2742e-02, PNorm = 102.5643, GNorm = 1.6473, lr_0 = 3.0865e-04
Validation binary_cross_entropy = 0.173253
Epoch 141
Loss = 1.5743e-01, PNorm = 102.6075, GNorm = 1.4205, lr_0 = 3.1000e-04
Validation binary_cross_entropy = 0.273192
Epoch 142
Loss = 2.0512e-01, PNorm = 102.6529, GNorm = 2.7666, lr_0 = 3.1135e-04
Validation binary_cross_entropy = 0.074441
Epoch 143
Loss = 1.5155e-01, PNorm = 102.7093, GNorm = 2.4669, lr_0 = 3.1269e-04
Validation binary_cross_entropy = 0.096434
Epoch 144
Loss = 8.4533e-02, PNorm = 102.7606, GNorm = 2.3061, lr_0 = 3.1404e-04
Validation binary_cross_entropy = 0.074865
Epoch 145
Loss = 6.9438e-02, PNorm = 102.8076, GNorm = 1.5898, lr_0 = 3.1538e-04
Validation binary_cross_entropy = 0.119107
Epoch 146
Loss = 7.5415e-02, PNorm = 102.8462, GNorm = 0.4777, lr_0 = 3.1673e-04
Validation binary_cross_entropy = 0.136922
Epoch 147
Loss = 1.3251e-01, PNorm = 102.8764, GNorm = 3.1071, lr_0 = 3.1807e-04
Validation binary_cross_entropy = 0.342766
Epoch 148
Loss = 7.5969e-02, PNorm = 102.9051, GNorm = 2.5459, lr_0 = 3.1942e-04
Validation binary_cross_entropy = 0.131118
Epoch 149
Loss = 6.4203e-02, PNorm = 102.9368, GNorm = 1.3718, lr_0 = 3.2076e-04
Loss = 9.2370e-02, PNorm = 102.9824, GNorm = 4.2313, lr_0 = 3.2211e-04
Validation binary_cross_entropy = 0.130312
Epoch 150
Loss = 1.0128e-01, PNorm = 103.0263, GNorm = 2.8922, lr_0 = 3.2345e-04
Validation binary_cross_entropy = 0.140689
Epoch 151
Loss = 1.1272e-01, PNorm = 103.0608, GNorm = 1.3533, lr_0 = 3.2480e-04
Validation binary_cross_entropy = 0.222133
Epoch 152
Loss = 1.0966e-01, PNorm = 103.1012, GNorm = 1.6639, lr_0 = 3.2614e-04
Validation binary_cross_entropy = 0.130573
Epoch 153
Loss = 9.5095e-02, PNorm = 103.1388, GNorm = 0.8343, lr_0 = 3.2749e-04
Validation binary_cross_entropy = 0.061541
Epoch 154
Loss = 5.9322e-02, PNorm = 103.1781, GNorm = 2.1024, lr_0 = 3.2883e-04
Validation binary_cross_entropy = 0.070299
Epoch 155
Loss = 1.0610e-01, PNorm = 103.2177, GNorm = 2.5409, lr_0 = 3.3018e-04
Validation binary_cross_entropy = 0.104823
Epoch 156
Loss = 1.3028e-01, PNorm = 103.2537, GNorm = 3.8109, lr_0 = 3.3152e-04
Validation binary_cross_entropy = 0.363509
Epoch 157
Loss = 2.5506e-01, PNorm = 103.2969, GNorm = 1.6078, lr_0 = 3.3287e-04
Validation binary_cross_entropy = 0.174620
Epoch 158
Loss = 1.3575e-01, PNorm = 103.3458, GNorm = 1.3113, lr_0 = 3.3422e-04
Validation binary_cross_entropy = 0.144628
Epoch 159
Loss = 5.6530e-02, PNorm = 103.3954, GNorm = 0.7114, lr_0 = 3.3556e-04
Loss = 6.7969e-02, PNorm = 103.4432, GNorm = 6.8678, lr_0 = 3.3691e-04
Validation binary_cross_entropy = 0.064149
Epoch 160
Loss = 8.4487e-02, PNorm = 103.4816, GNorm = 3.4887, lr_0 = 3.3825e-04
Validation binary_cross_entropy = 0.137439
Epoch 161
Loss = 9.5819e-02, PNorm = 103.5270, GNorm = 3.8971, lr_0 = 3.3960e-04
Validation binary_cross_entropy = 0.093007
Epoch 162
Loss = 1.1069e-01, PNorm = 103.5778, GNorm = 2.6252, lr_0 = 3.4094e-04
Validation binary_cross_entropy = 0.059849
Epoch 163
Loss = 6.6844e-02, PNorm = 103.6275, GNorm = 1.5003, lr_0 = 3.4229e-04
Validation binary_cross_entropy = 0.093409
Epoch 164
Loss = 6.5873e-02, PNorm = 103.6776, GNorm = 1.8216, lr_0 = 3.4363e-04
Validation binary_cross_entropy = 0.161161
Epoch 165
Loss = 7.2357e-02, PNorm = 103.7248, GNorm = 2.0596, lr_0 = 3.4498e-04
Validation binary_cross_entropy = 0.086988
Epoch 166
Loss = 8.4904e-02, PNorm = 103.7707, GNorm = 1.5067, lr_0 = 3.4632e-04
Validation binary_cross_entropy = 0.079833
Epoch 167
Loss = 7.8310e-02, PNorm = 103.8124, GNorm = 3.9642, lr_0 = 3.4767e-04
Validation binary_cross_entropy = 0.136980
Epoch 168
Loss = 7.2141e-02, PNorm = 103.8617, GNorm = 0.9895, lr_0 = 3.4901e-04
Validation binary_cross_entropy = 0.097601
Epoch 169
Loss = 6.4713e-02, PNorm = 103.8992, GNorm = 1.2164, lr_0 = 3.5036e-04
Loss = 1.0080e-01, PNorm = 103.9454, GNorm = 2.4143, lr_0 = 3.5170e-04
Validation binary_cross_entropy = 0.090879
Epoch 170
Loss = 6.8125e-02, PNorm = 103.9973, GNorm = 4.0363, lr_0 = 3.5305e-04
Validation binary_cross_entropy = 0.148278
Epoch 171
Loss = 1.4691e-01, PNorm = 104.0441, GNorm = 3.4726, lr_0 = 3.5439e-04
Validation binary_cross_entropy = 0.261022
Epoch 172
Loss = 1.4540e-01, PNorm = 104.1030, GNorm = 2.2754, lr_0 = 3.5574e-04
Validation binary_cross_entropy = 0.260224
Epoch 173
Loss = 1.5535e-01, PNorm = 104.1641, GNorm = 5.1744, lr_0 = 3.5709e-04
Validation binary_cross_entropy = 0.210309
Epoch 174
Loss = 7.4244e-02, PNorm = 104.2256, GNorm = 1.0076, lr_0 = 3.5843e-04
Validation binary_cross_entropy = 0.236238
Epoch 175
Loss = 6.8067e-02, PNorm = 104.2827, GNorm = 1.5981, lr_0 = 3.5978e-04
Validation binary_cross_entropy = 0.151305
Epoch 176
Loss = 7.8441e-02, PNorm = 104.3311, GNorm = 1.1004, lr_0 = 3.6112e-04
Validation binary_cross_entropy = 0.186088
Epoch 177
Loss = 8.0544e-02, PNorm = 104.3848, GNorm = 1.2958, lr_0 = 3.6247e-04
Validation binary_cross_entropy = 0.130156
Epoch 178
Loss = 6.0129e-02, PNorm = 104.4301, GNorm = 0.9328, lr_0 = 3.6381e-04
Validation binary_cross_entropy = 0.138195
Epoch 179
Loss = 7.8274e-02, PNorm = 104.4696, GNorm = 1.1535, lr_0 = 3.6516e-04
Loss = 7.6168e-02, PNorm = 104.5038, GNorm = 0.5283, lr_0 = 3.6650e-04
Validation binary_cross_entropy = 0.071410
Epoch 180
Loss = 4.6759e-02, PNorm = 104.5404, GNorm = 1.7760, lr_0 = 3.6785e-04
Validation binary_cross_entropy = 0.140418
Epoch 181
Loss = 6.4353e-02, PNorm = 104.5790, GNorm = 0.2636, lr_0 = 3.6919e-04
Validation binary_cross_entropy = 0.369993
Epoch 182
Loss = 1.0637e-01, PNorm = 104.6380, GNorm = 2.1706, lr_0 = 3.7054e-04
Validation binary_cross_entropy = 0.234830
Epoch 183
Loss = 6.5552e-02, PNorm = 104.7002, GNorm = 0.6930, lr_0 = 3.7188e-04
Validation binary_cross_entropy = 0.133269
Epoch 184
Loss = 4.5976e-02, PNorm = 104.7461, GNorm = 1.2062, lr_0 = 3.7323e-04
Validation binary_cross_entropy = 0.156220
Epoch 185
Loss = 4.8529e-02, PNorm = 104.7969, GNorm = 1.2311, lr_0 = 3.7457e-04
Validation binary_cross_entropy = 0.172452
Epoch 186
Loss = 8.6694e-02, PNorm = 104.8406, GNorm = 2.0094, lr_0 = 3.7592e-04
Validation binary_cross_entropy = 0.196955
Epoch 187
Loss = 9.1574e-02, PNorm = 104.8893, GNorm = 2.4823, lr_0 = 3.7726e-04
Validation binary_cross_entropy = 0.282400
Epoch 188
Loss = 8.6715e-02, PNorm = 104.9379, GNorm = 2.0042, lr_0 = 3.7861e-04
Validation binary_cross_entropy = 0.138340
Epoch 189
Loss = 4.0279e-02, PNorm = 104.9983, GNorm = 1.3915, lr_0 = 3.7996e-04
Loss = 3.8853e-02, PNorm = 105.0504, GNorm = 1.7248, lr_0 = 3.8130e-04
Validation binary_cross_entropy = 0.100475
Epoch 190
Loss = 5.9726e-02, PNorm = 105.0934, GNorm = 1.0877, lr_0 = 3.8265e-04
Validation binary_cross_entropy = 0.087657
Epoch 191
Loss = 7.9422e-02, PNorm = 105.1370, GNorm = 1.6457, lr_0 = 3.8399e-04
Validation binary_cross_entropy = 0.108315
Epoch 192
Loss = 6.0889e-02, PNorm = 105.1804, GNorm = 0.7202, lr_0 = 3.8534e-04
Validation binary_cross_entropy = 0.081125
Epoch 193
Loss = 8.6564e-02, PNorm = 105.2275, GNorm = 1.0928, lr_0 = 3.8668e-04
Validation binary_cross_entropy = 0.111617
Epoch 194
Loss = 9.4768e-02, PNorm = 105.2783, GNorm = 2.3320, lr_0 = 3.8803e-04
Validation binary_cross_entropy = 0.083391
Epoch 195
Loss = 4.4771e-02, PNorm = 105.3202, GNorm = 1.3101, lr_0 = 3.8937e-04
Validation binary_cross_entropy = 0.084293
Epoch 196
Loss = 4.4123e-02, PNorm = 105.3675, GNorm = 3.6378, lr_0 = 3.9072e-04
Validation binary_cross_entropy = 0.127102
Epoch 197
Loss = 2.5232e-02, PNorm = 105.4180, GNorm = 2.2945, lr_0 = 3.9206e-04
Validation binary_cross_entropy = 0.186043
Epoch 198
Loss = 7.9408e-02, PNorm = 105.4799, GNorm = 2.7387, lr_0 = 3.9341e-04
Validation binary_cross_entropy = 0.230183
Epoch 199
Loss = 2.8833e-01, PNorm = 105.5464, GNorm = 2.6118, lr_0 = 3.9475e-04
Loss = 1.3765e-01, PNorm = 105.6326, GNorm = 1.4534, lr_0 = 3.9610e-04
Validation binary_cross_entropy = 0.160844
Epoch 200
Loss = 9.3361e-02, PNorm = 105.7231, GNorm = 0.8610, lr_0 = 3.9744e-04
Validation binary_cross_entropy = 0.153818
Epoch 201
Loss = 1.0119e-01, PNorm = 105.7964, GNorm = 2.2694, lr_0 = 3.9879e-04
Validation binary_cross_entropy = 0.102093
Epoch 202
Loss = 1.0107e-01, PNorm = 105.8537, GNorm = 2.8234, lr_0 = 4.0013e-04
Validation binary_cross_entropy = 0.158938
Epoch 203
Loss = 1.0076e-01, PNorm = 105.9099, GNorm = 2.7691, lr_0 = 4.0148e-04
Validation binary_cross_entropy = 0.170378
Epoch 204
Loss = 7.9961e-02, PNorm = 105.9617, GNorm = 2.0079, lr_0 = 4.0283e-04
Validation binary_cross_entropy = 0.086322
Epoch 205
Loss = 7.4896e-02, PNorm = 106.0129, GNorm = 0.7523, lr_0 = 4.0417e-04
Validation binary_cross_entropy = 0.080706
Epoch 206
Loss = 5.4864e-02, PNorm = 106.0532, GNorm = 1.5595, lr_0 = 4.0552e-04
Validation binary_cross_entropy = 0.156769
Epoch 207
Loss = 4.0033e-02, PNorm = 106.1008, GNorm = 1.2874, lr_0 = 4.0686e-04
Validation binary_cross_entropy = 0.150061
Epoch 208
Loss = 3.0197e-01, PNorm = 106.1335, GNorm = 3.7159, lr_0 = 4.0821e-04
Validation binary_cross_entropy = 0.345173
Epoch 209
Loss = 2.5797e-01, PNorm = 106.1753, GNorm = 2.2203, lr_0 = 4.0955e-04
Loss = 1.2914e-01, PNorm = 106.2531, GNorm = 0.7705, lr_0 = 4.1090e-04
Validation binary_cross_entropy = 0.116048
Epoch 210
Loss = 5.4781e-02, PNorm = 106.3330, GNorm = 0.8455, lr_0 = 4.1224e-04
Validation binary_cross_entropy = 0.052837
Epoch 211
Loss = 3.5854e-02, PNorm = 106.4014, GNorm = 0.0542, lr_0 = 4.1359e-04
Validation binary_cross_entropy = 0.103586
Epoch 212
Loss = 8.0950e-02, PNorm = 106.4417, GNorm = 2.0517, lr_0 = 4.1493e-04
Validation binary_cross_entropy = 0.095266
Epoch 213
Loss = 5.3196e-02, PNorm = 106.4882, GNorm = 1.3741, lr_0 = 4.1628e-04
Validation binary_cross_entropy = 0.089978
Epoch 214
Loss = 9.7939e-03, PNorm = 106.5399, GNorm = 1.2974, lr_0 = 4.1762e-04
Validation binary_cross_entropy = 0.179770
Epoch 215
Loss = 1.4271e-01, PNorm = 106.5698, GNorm = 0.6957, lr_0 = 4.1897e-04
Validation binary_cross_entropy = 0.081142
Epoch 216
Loss = 1.3846e-01, PNorm = 106.6556, GNorm = 0.8427, lr_0 = 4.2031e-04
Validation binary_cross_entropy = 0.072036
Epoch 217
Loss = 1.0868e-01, PNorm = 106.7520, GNorm = 1.1804, lr_0 = 4.2166e-04
Validation binary_cross_entropy = 0.120109
Epoch 218
Loss = 4.2939e-02, PNorm = 106.8433, GNorm = 0.9534, lr_0 = 4.2300e-04
Validation binary_cross_entropy = 0.134163
Epoch 219
Loss = 3.5324e-02, PNorm = 106.9116, GNorm = 1.5059, lr_0 = 4.2435e-04
Loss = 6.9449e-02, PNorm = 106.9700, GNorm = 3.5725, lr_0 = 4.2570e-04
Validation binary_cross_entropy = 0.137280
Epoch 220
Loss = 1.4625e-01, PNorm = 107.0402, GNorm = 2.7586, lr_0 = 4.2704e-04
Validation binary_cross_entropy = 0.096133
Epoch 221
Loss = 8.6239e-02, PNorm = 107.1244, GNorm = 0.2600, lr_0 = 4.2839e-04
Validation binary_cross_entropy = 0.099635
Epoch 222
Loss = 6.5472e-02, PNorm = 107.1981, GNorm = 0.6516, lr_0 = 4.2973e-04
Validation binary_cross_entropy = 0.069541
Epoch 223
Loss = 7.9362e-02, PNorm = 107.2662, GNorm = 3.0621, lr_0 = 4.3108e-04
Validation binary_cross_entropy = 0.078406
Epoch 224
Loss = 3.7328e-02, PNorm = 107.3022, GNorm = 0.6789, lr_0 = 4.3242e-04
Validation binary_cross_entropy = 0.097917
Epoch 225
Loss = 8.8009e-02, PNorm = 107.3468, GNorm = 2.1110, lr_0 = 4.3377e-04
Validation binary_cross_entropy = 0.153855
Epoch 226
Loss = 5.1808e-02, PNorm = 107.3887, GNorm = 1.1581, lr_0 = 4.3511e-04
Validation binary_cross_entropy = 0.060181
Epoch 227
Loss = 2.0256e-02, PNorm = 107.4386, GNorm = 0.8905, lr_0 = 4.3646e-04
Validation binary_cross_entropy = 0.048349
Epoch 228
Loss = 5.4673e-02, PNorm = 107.4803, GNorm = 0.4286, lr_0 = 4.3780e-04
Validation binary_cross_entropy = 0.065759
Epoch 229
Loss = 1.7085e-02, PNorm = 107.5033, GNorm = 0.7252, lr_0 = 4.3915e-04
Loss = 5.7175e-02, PNorm = 107.5523, GNorm = 1.8923, lr_0 = 4.4049e-04
Validation binary_cross_entropy = 0.050252
Epoch 230
Loss = 5.7767e-02, PNorm = 107.6173, GNorm = 2.7629, lr_0 = 4.4184e-04
Validation binary_cross_entropy = 0.117424
Epoch 231
Loss = 3.6548e-02, PNorm = 107.6750, GNorm = 1.5088, lr_0 = 4.4318e-04
Validation binary_cross_entropy = 0.070299
Epoch 232
Loss = 4.2750e-02, PNorm = 107.7318, GNorm = 1.6198, lr_0 = 4.4453e-04
Validation binary_cross_entropy = 0.078931
Epoch 233
Loss = 4.1453e-02, PNorm = 107.7786, GNorm = 1.7035, lr_0 = 4.4587e-04
Validation binary_cross_entropy = 0.121255
Epoch 234
Loss = 8.9116e-02, PNorm = 107.8216, GNorm = 2.9042, lr_0 = 4.4722e-04
Validation binary_cross_entropy = 0.175967
Epoch 235
Loss = 3.1663e-02, PNorm = 107.8728, GNorm = 0.0980, lr_0 = 4.4857e-04
Validation binary_cross_entropy = 0.086665
Epoch 236
Loss = 7.6747e-02, PNorm = 107.9377, GNorm = 2.3777, lr_0 = 4.4991e-04
Validation binary_cross_entropy = 0.130945
Epoch 237
Loss = 6.5788e-02, PNorm = 108.0005, GNorm = 2.6958, lr_0 = 4.5126e-04
Validation binary_cross_entropy = 0.157128
Epoch 238
Loss = 1.2028e-01, PNorm = 108.0617, GNorm = 1.3053, lr_0 = 4.5260e-04
Validation binary_cross_entropy = 0.078216
Epoch 239
Loss = 7.6315e-02, PNorm = 108.1238, GNorm = 3.0637, lr_0 = 4.5395e-04
Loss = 9.1636e-02, PNorm = 108.1919, GNorm = 1.0393, lr_0 = 4.5529e-04
Validation binary_cross_entropy = 0.197576
Epoch 240
Loss = 6.1526e-02, PNorm = 108.2658, GNorm = 1.7455, lr_0 = 4.5664e-04
Validation binary_cross_entropy = 0.085276
Epoch 241
Loss = 4.9804e-02, PNorm = 108.3392, GNorm = 1.5536, lr_0 = 4.5798e-04
Validation binary_cross_entropy = 0.092979
Epoch 242
Loss = 3.8949e-02, PNorm = 108.4035, GNorm = 1.3710, lr_0 = 4.5933e-04
Validation binary_cross_entropy = 0.130775
Epoch 243
Loss = 4.5867e-02, PNorm = 108.4461, GNorm = 0.3945, lr_0 = 4.6067e-04
Validation binary_cross_entropy = 0.075232
Epoch 244
Loss = 8.1681e-02, PNorm = 108.4912, GNorm = 1.6411, lr_0 = 4.6202e-04
Validation binary_cross_entropy = 0.071994
Epoch 245
Loss = 1.0052e-01, PNorm = 108.5367, GNorm = 1.5601, lr_0 = 4.6336e-04
Validation binary_cross_entropy = 0.054120
Epoch 246
Loss = 3.7674e-02, PNorm = 108.5949, GNorm = 0.9492, lr_0 = 4.6471e-04
Validation binary_cross_entropy = 0.080186
Epoch 247
Loss = 3.5091e-02, PNorm = 108.6506, GNorm = 1.1158, lr_0 = 4.6605e-04
Validation binary_cross_entropy = 0.091904
Epoch 248
Loss = 5.6543e-02, PNorm = 108.7042, GNorm = 1.8547, lr_0 = 4.6740e-04
Validation binary_cross_entropy = 0.151176
Epoch 249
Loss = 7.0957e-02, PNorm = 108.7517, GNorm = 1.2747, lr_0 = 4.6874e-04
Loss = 1.3109e-01, PNorm = 108.7884, GNorm = 0.9786, lr_0 = 4.7009e-04
Validation binary_cross_entropy = 0.106341
Epoch 250
Loss = 8.3913e-02, PNorm = 108.8404, GNorm = 2.7370, lr_0 = 4.7143e-04
Validation binary_cross_entropy = 0.112787
Epoch 251
Loss = 3.9176e-02, PNorm = 108.9068, GNorm = 0.7265, lr_0 = 4.7278e-04
Validation binary_cross_entropy = 0.047458
Epoch 252
Loss = 2.7538e-02, PNorm = 108.9741, GNorm = 0.0341, lr_0 = 4.7413e-04
Validation binary_cross_entropy = 0.080420
Epoch 253
Loss = 1.1289e-01, PNorm = 109.0217, GNorm = 1.5992, lr_0 = 4.7547e-04
Validation binary_cross_entropy = 0.179008
Epoch 254
Loss = 5.6008e-02, PNorm = 109.0963, GNorm = 0.4284, lr_0 = 4.7682e-04
Validation binary_cross_entropy = 0.094455
Epoch 255
Loss = 2.8244e-02, PNorm = 109.1684, GNorm = 2.2103, lr_0 = 4.7816e-04
Validation binary_cross_entropy = 0.089415
Epoch 256
Loss = 3.0850e-02, PNorm = 109.2291, GNorm = 0.3654, lr_0 = 4.7951e-04
Validation binary_cross_entropy = 0.103909
Epoch 257
Loss = 4.8841e-02, PNorm = 109.2848, GNorm = 0.6729, lr_0 = 4.8085e-04
Validation binary_cross_entropy = 0.057675
Epoch 258
Loss = 6.3753e-02, PNorm = 109.3280, GNorm = 1.5286, lr_0 = 4.8220e-04
Validation binary_cross_entropy = 0.099283
Epoch 259
Loss = 8.9313e-03, PNorm = 109.3823, GNorm = 0.1799, lr_0 = 4.8354e-04
Loss = 7.5256e-02, PNorm = 109.4389, GNorm = 0.2860, lr_0 = 4.8489e-04
Validation binary_cross_entropy = 0.166995
Epoch 260
Loss = 8.4273e-02, PNorm = 109.5013, GNorm = 1.1653, lr_0 = 4.8623e-04
Validation binary_cross_entropy = 0.044455
Epoch 261
Loss = 4.5214e-02, PNorm = 109.5824, GNorm = 0.2867, lr_0 = 4.8758e-04
Validation binary_cross_entropy = 0.067276
Epoch 262
Loss = 7.8361e-02, PNorm = 109.6356, GNorm = 2.2013, lr_0 = 4.8892e-04
Validation binary_cross_entropy = 0.103975
Epoch 263
Loss = 9.2397e-02, PNorm = 109.6863, GNorm = 0.7925, lr_0 = 4.9027e-04
Validation binary_cross_entropy = 0.059822
Epoch 264
Loss = 5.3295e-02, PNorm = 109.7338, GNorm = 0.6176, lr_0 = 4.9161e-04
Validation binary_cross_entropy = 0.036685
Epoch 265
Loss = 3.3152e-02, PNorm = 109.7993, GNorm = 1.1098, lr_0 = 4.9296e-04
Validation binary_cross_entropy = 0.069132
Epoch 266
Loss = 6.8441e-02, PNorm = 109.8655, GNorm = 1.7084, lr_0 = 4.9430e-04
Validation binary_cross_entropy = 0.094096
Epoch 267
Loss = 1.6316e-02, PNorm = 109.9279, GNorm = 0.2398, lr_0 = 4.9565e-04
Validation binary_cross_entropy = 0.050017
Epoch 268
Loss = 3.0572e-02, PNorm = 109.9979, GNorm = 0.6108, lr_0 = 4.9700e-04
Validation binary_cross_entropy = 0.086974
Epoch 269
Loss = 4.6990e-02, PNorm = 110.0930, GNorm = 1.1592, lr_0 = 4.9834e-04
Loss = 1.1178e-01, PNorm = 110.1786, GNorm = 1.1604, lr_0 = 4.9969e-04
Validation binary_cross_entropy = 0.116415
Epoch 270
Loss = 6.3948e-02, PNorm = 110.2679, GNorm = 0.7081, lr_0 = 5.0103e-04
Validation binary_cross_entropy = 0.039964
Epoch 271
Loss = 7.3940e-02, PNorm = 110.3445, GNorm = 0.2178, lr_0 = 5.0238e-04
Validation binary_cross_entropy = 0.074881
Epoch 272
Loss = 5.5192e-02, PNorm = 110.4057, GNorm = 0.7039, lr_0 = 5.0372e-04
Validation binary_cross_entropy = 0.076290
Epoch 273
Loss = 3.0543e-02, PNorm = 110.4749, GNorm = 2.5510, lr_0 = 5.0507e-04
Validation binary_cross_entropy = 0.097813
Epoch 274
Loss = 5.3228e-02, PNorm = 110.5357, GNorm = 0.7707, lr_0 = 5.0641e-04
Validation binary_cross_entropy = 0.065747
Epoch 275
Loss = 4.3092e-02, PNorm = 110.6129, GNorm = 1.2511, lr_0 = 5.0776e-04
Validation binary_cross_entropy = 0.106501
Epoch 276
Loss = 3.6304e-02, PNorm = 110.6840, GNorm = 0.5149, lr_0 = 5.0910e-04
Validation binary_cross_entropy = 0.059690
Epoch 277
Loss = 3.0053e-02, PNorm = 110.7514, GNorm = 1.5981, lr_0 = 5.1045e-04
Validation binary_cross_entropy = 0.064599
Epoch 278
Loss = 4.5364e-03, PNorm = 110.8110, GNorm = 0.1399, lr_0 = 5.1179e-04
Validation binary_cross_entropy = 0.043022
Epoch 279
Loss = 1.7253e-01, PNorm = 110.8579, GNorm = 3.6742, lr_0 = 5.1314e-04
Loss = 6.9574e-02, PNorm = 110.9278, GNorm = 1.7192, lr_0 = 5.1448e-04
Validation binary_cross_entropy = 0.083462
Epoch 280
Loss = 5.7050e-02, PNorm = 111.0118, GNorm = 0.2338, lr_0 = 5.1583e-04
Validation binary_cross_entropy = 0.067899
Epoch 281
Loss = 5.4355e-02, PNorm = 111.0896, GNorm = 1.0263, lr_0 = 5.1717e-04
Validation binary_cross_entropy = 0.052303
Epoch 282
Loss = 2.1805e-02, PNorm = 111.1682, GNorm = 2.7137, lr_0 = 5.1852e-04
Validation binary_cross_entropy = 0.081559
Epoch 283
Loss = 8.2847e-02, PNorm = 111.2352, GNorm = 3.5455, lr_0 = 5.1987e-04
Validation binary_cross_entropy = 0.130584
Epoch 284
Loss = 8.4155e-02, PNorm = 111.2937, GNorm = 0.1979, lr_0 = 5.2121e-04
Validation binary_cross_entropy = 0.145551
Epoch 285
Loss = 6.8155e-02, PNorm = 111.3738, GNorm = 0.5539, lr_0 = 5.2256e-04
Validation binary_cross_entropy = 0.074823
Epoch 286
Loss = 4.8730e-02, PNorm = 111.4426, GNorm = 1.3112, lr_0 = 5.2390e-04
Validation binary_cross_entropy = 0.170733
Epoch 287
Loss = 1.1075e-01, PNorm = 111.5188, GNorm = 0.6976, lr_0 = 5.2525e-04
Validation binary_cross_entropy = 0.183780
Epoch 288
Loss = 1.0042e-01, PNorm = 111.6231, GNorm = 2.7773, lr_0 = 5.2659e-04
Validation binary_cross_entropy = 0.257934
Epoch 289
Loss = 1.6685e-01, PNorm = 111.7496, GNorm = 2.1033, lr_0 = 5.2794e-04
Loss = 1.1595e-01, PNorm = 111.8626, GNorm = 0.6608, lr_0 = 5.2928e-04
Validation binary_cross_entropy = 0.090313
Epoch 290
Loss = 5.7878e-02, PNorm = 111.9557, GNorm = 1.4440, lr_0 = 5.3063e-04
Validation binary_cross_entropy = 0.051205
Epoch 291
Loss = 4.0549e-02, PNorm = 112.0354, GNorm = 0.2581, lr_0 = 5.3197e-04
Validation binary_cross_entropy = 0.072426
Epoch 292
Loss = 4.8483e-02, PNorm = 112.1008, GNorm = 1.3689, lr_0 = 5.3332e-04
Validation binary_cross_entropy = 0.046180
Epoch 293
Loss = 4.8342e-02, PNorm = 112.1703, GNorm = 0.5952, lr_0 = 5.3466e-04
Validation binary_cross_entropy = 0.070917
Epoch 294
Loss = 3.8101e-02, PNorm = 112.2413, GNorm = 0.2268, lr_0 = 5.3601e-04
Validation binary_cross_entropy = 0.111697
Epoch 295
Loss = 1.3777e-02, PNorm = 112.3046, GNorm = 0.5238, lr_0 = 5.3735e-04
Validation binary_cross_entropy = 0.052107
Epoch 296
Loss = 7.8514e-02, PNorm = 112.3696, GNorm = 3.2580, lr_0 = 5.3870e-04
Validation binary_cross_entropy = 0.072272
Epoch 297
Loss = 2.8911e-02, PNorm = 112.4581, GNorm = 0.8889, lr_0 = 5.4004e-04
Validation binary_cross_entropy = 0.103369
Epoch 298
Loss = 7.6234e-02, PNorm = 112.5598, GNorm = 3.0302, lr_0 = 5.4139e-04
Validation binary_cross_entropy = 0.112904
Epoch 299
Loss = 1.0636e-01, PNorm = 112.6661, GNorm = 1.5535, lr_0 = 5.4274e-04
Loss = 8.7625e-02, PNorm = 112.7593, GNorm = 0.2004, lr_0 = 5.4408e-04
Validation binary_cross_entropy = 0.089022
Epoch 300
Loss = 7.2535e-02, PNorm = 112.8653, GNorm = 0.4041, lr_0 = 5.4543e-04
Validation binary_cross_entropy = 0.061657
Epoch 301
Loss = 4.8470e-02, PNorm = 112.9722, GNorm = 1.6430, lr_0 = 5.4677e-04
Validation binary_cross_entropy = 0.151331
Epoch 302
Loss = 5.5902e-02, PNorm = 113.0522, GNorm = 2.0723, lr_0 = 5.4812e-04
Validation binary_cross_entropy = 0.069834
Epoch 303
Loss = 4.2697e-02, PNorm = 113.1202, GNorm = 1.6342, lr_0 = 5.4946e-04
Validation binary_cross_entropy = 0.052359
Epoch 304
Loss = 1.6162e-01, PNorm = 113.1718, GNorm = 3.5287, lr_0 = 5.5081e-04
Validation binary_cross_entropy = 0.073125
Epoch 305
Loss = 8.4820e-02, PNorm = 113.2321, GNorm = 0.6814, lr_0 = 5.5215e-04
Validation binary_cross_entropy = 0.058287
Epoch 306
Loss = 4.2264e-02, PNorm = 113.3290, GNorm = 1.3913, lr_0 = 5.5350e-04
Validation binary_cross_entropy = 0.080676
Epoch 307
Loss = 7.8966e-03, PNorm = 113.4288, GNorm = 0.7613, lr_0 = 5.5484e-04
Validation binary_cross_entropy = 0.053270
Epoch 308
Loss = 5.8065e-02, PNorm = 113.5085, GNorm = 1.4337, lr_0 = 5.5619e-04
Validation binary_cross_entropy = 0.113558
Epoch 309
Loss = 3.9187e-02, PNorm = 113.5570, GNorm = 1.8307, lr_0 = 5.5753e-04
Loss = 4.9227e-02, PNorm = 113.6383, GNorm = 0.3027, lr_0 = 5.5888e-04
Validation binary_cross_entropy = 0.104271
Epoch 310
Loss = 4.3644e-02, PNorm = 113.7311, GNorm = 2.2993, lr_0 = 5.6022e-04
Validation binary_cross_entropy = 0.108955
Epoch 311
Loss = 3.3350e-02, PNorm = 113.8290, GNorm = 0.0942, lr_0 = 5.6157e-04
Validation binary_cross_entropy = 0.147863
Epoch 312
Loss = 8.6027e-02, PNorm = 113.9173, GNorm = 0.6098, lr_0 = 5.6291e-04
Validation binary_cross_entropy = 0.105763
Epoch 313
Loss = 5.5519e-02, PNorm = 114.0406, GNorm = 1.5949, lr_0 = 5.6426e-04
Validation binary_cross_entropy = 0.058684
Epoch 314
Loss = 3.8290e-02, PNorm = 114.1588, GNorm = 0.1753, lr_0 = 5.6561e-04
Validation binary_cross_entropy = 0.082342
Epoch 315
Loss = 7.4134e-02, PNorm = 114.2484, GNorm = 0.4091, lr_0 = 5.6695e-04
Validation binary_cross_entropy = 0.038756
Epoch 316
Loss = 2.1413e-02, PNorm = 114.3371, GNorm = 0.1515, lr_0 = 5.6830e-04
Validation binary_cross_entropy = 0.065271
Epoch 317
Loss = 8.4893e-03, PNorm = 114.4177, GNorm = 0.0403, lr_0 = 5.6964e-04
Validation binary_cross_entropy = 0.143464
Epoch 318
Loss = 1.7818e-02, PNorm = 114.4741, GNorm = 0.0236, lr_0 = 5.7099e-04
Validation binary_cross_entropy = 0.067790
Epoch 319
Loss = 3.8427e-02, PNorm = 114.5853, GNorm = 0.8750, lr_0 = 5.7233e-04
Loss = 6.9948e-02, PNorm = 114.7131, GNorm = 3.4851, lr_0 = 5.7368e-04
Validation binary_cross_entropy = 0.077129
Epoch 320
Loss = 1.9968e-02, PNorm = 114.8335, GNorm = 0.1488, lr_0 = 5.7502e-04
Validation binary_cross_entropy = 0.104591
Epoch 321
Loss = 5.6698e-02, PNorm = 114.9339, GNorm = 1.5781, lr_0 = 5.7637e-04
Validation binary_cross_entropy = 0.050631
Epoch 322
Loss = 7.3166e-02, PNorm = 115.0882, GNorm = 1.5467, lr_0 = 5.7771e-04
Validation binary_cross_entropy = 0.073721
Epoch 323
Loss = 1.3724e-01, PNorm = 115.2998, GNorm = 1.9442, lr_0 = 5.7906e-04
Validation binary_cross_entropy = 0.111918
Epoch 324
Loss = 7.5497e-02, PNorm = 115.4920, GNorm = 0.5738, lr_0 = 5.8040e-04
Validation binary_cross_entropy = 0.096140
Epoch 325
Loss = 7.4350e-02, PNorm = 115.6367, GNorm = 1.7219, lr_0 = 5.8175e-04
Validation binary_cross_entropy = 0.163041
Epoch 326
Loss = 4.8568e-02, PNorm = 115.7674, GNorm = 1.4096, lr_0 = 5.8309e-04
Validation binary_cross_entropy = 0.093405
Epoch 327
Loss = 6.2291e-02, PNorm = 115.8812, GNorm = 1.4746, lr_0 = 5.8444e-04
Validation binary_cross_entropy = 0.071032
Epoch 328
Loss = 1.6953e-02, PNorm = 115.9635, GNorm = 0.7415, lr_0 = 5.8578e-04
Validation binary_cross_entropy = 0.052389
Epoch 329
Loss = 3.8019e-02, PNorm = 116.0433, GNorm = 1.2442, lr_0 = 5.8713e-04
Loss = 4.1167e-02, PNorm = 116.1339, GNorm = 0.2203, lr_0 = 5.8848e-04
Validation binary_cross_entropy = 0.088901
Epoch 330
Loss = 6.6596e-02, PNorm = 116.2413, GNorm = 2.2382, lr_0 = 5.8982e-04
Validation binary_cross_entropy = 0.071876
Epoch 331
Loss = 6.7869e-02, PNorm = 116.3427, GNorm = 1.5999, lr_0 = 5.9117e-04
Validation binary_cross_entropy = 0.079616
Epoch 332
Loss = 6.2814e-02, PNorm = 116.4496, GNorm = 0.1454, lr_0 = 5.9251e-04
Validation binary_cross_entropy = 0.071877
Epoch 333
Loss = 4.1889e-02, PNorm = 116.5505, GNorm = 1.2431, lr_0 = 5.9386e-04
Validation binary_cross_entropy = 0.098389
Epoch 334
Loss = 3.5722e-02, PNorm = 116.6406, GNorm = 1.6042, lr_0 = 5.9520e-04
Validation binary_cross_entropy = 0.109185
Epoch 335
Loss = 1.7132e-02, PNorm = 116.7432, GNorm = 1.3171, lr_0 = 5.9655e-04
Validation binary_cross_entropy = 0.074689
Epoch 336
Loss = 4.3399e-02, PNorm = 116.8301, GNorm = 1.2959, lr_0 = 5.9789e-04
Validation binary_cross_entropy = 0.092542
Epoch 337
Loss = 7.4586e-02, PNorm = 116.8840, GNorm = 3.2825, lr_0 = 5.9924e-04
Validation binary_cross_entropy = 0.064224
Epoch 338
Loss = 6.8330e-02, PNorm = 116.9777, GNorm = 0.1544, lr_0 = 6.0058e-04
Validation binary_cross_entropy = 0.086952
Epoch 339
Loss = 7.2033e-02, PNorm = 117.0764, GNorm = 1.1265, lr_0 = 6.0193e-04
Loss = 6.0853e-02, PNorm = 117.1817, GNorm = 0.7281, lr_0 = 6.0327e-04
Validation binary_cross_entropy = 0.110692
Epoch 340
Loss = 4.9045e-02, PNorm = 117.2692, GNorm = 0.5743, lr_0 = 6.0462e-04
Validation binary_cross_entropy = 0.041288
Epoch 341
Loss = 1.6337e-02, PNorm = 117.3472, GNorm = 0.2040, lr_0 = 6.0596e-04
Validation binary_cross_entropy = 0.061082
Epoch 342
Loss = 4.2083e-02, PNorm = 117.4205, GNorm = 2.5873, lr_0 = 6.0731e-04
Validation binary_cross_entropy = 0.059521
Epoch 343
Loss = 7.9635e-02, PNorm = 117.5108, GNorm = 2.1278, lr_0 = 6.0865e-04
Validation binary_cross_entropy = 0.055728
Epoch 344
Loss = 7.4301e-02, PNorm = 117.6164, GNorm = 0.9222, lr_0 = 6.1000e-04
Validation binary_cross_entropy = 0.101067
Epoch 345
Loss = 7.3900e-02, PNorm = 117.7325, GNorm = 1.1657, lr_0 = 6.1135e-04
Validation binary_cross_entropy = 0.120659
Epoch 346
Loss = 4.2900e-02, PNorm = 117.8355, GNorm = 2.4319, lr_0 = 6.1269e-04
Validation binary_cross_entropy = 0.049051
Epoch 347
Loss = 2.2748e-02, PNorm = 117.9408, GNorm = 1.5252, lr_0 = 6.1404e-04
Validation binary_cross_entropy = 0.078436
Epoch 348
Loss = 4.9840e-02, PNorm = 118.0376, GNorm = 1.0474, lr_0 = 6.1538e-04
Validation binary_cross_entropy = 0.187058
Epoch 349
Loss = 7.6740e-02, PNorm = 118.1363, GNorm = 0.8509, lr_0 = 6.1673e-04
Loss = 4.8813e-02, PNorm = 118.2625, GNorm = 1.5007, lr_0 = 6.1807e-04
Validation binary_cross_entropy = 0.099812
Epoch 350
Loss = 4.0521e-02, PNorm = 118.3892, GNorm = 0.0655, lr_0 = 6.1942e-04
Validation binary_cross_entropy = 0.094467
Epoch 351
Loss = 8.8162e-02, PNorm = 118.4839, GNorm = 0.8715, lr_0 = 6.2076e-04
Validation binary_cross_entropy = 0.101804
Epoch 352
Loss = 8.1309e-02, PNorm = 118.5821, GNorm = 0.9290, lr_0 = 6.2211e-04
Validation binary_cross_entropy = 0.111281
Epoch 353
Loss = 1.7361e-02, PNorm = 118.6882, GNorm = 0.1629, lr_0 = 6.2345e-04
Validation binary_cross_entropy = 0.046219
Epoch 354
Loss = 3.0535e-02, PNorm = 118.7848, GNorm = 0.5998, lr_0 = 6.2480e-04
Validation binary_cross_entropy = 0.056459
Epoch 355
Loss = 9.0171e-02, PNorm = 118.8739, GNorm = 4.4099, lr_0 = 6.2614e-04
Validation binary_cross_entropy = 0.073841
Epoch 356
Loss = 5.1743e-02, PNorm = 118.9689, GNorm = 1.3606, lr_0 = 6.2749e-04
Validation binary_cross_entropy = 0.123220
Epoch 357
Loss = 8.3060e-02, PNorm = 119.0916, GNorm = 2.0421, lr_0 = 6.2883e-04
Validation binary_cross_entropy = 0.143839
Epoch 358
Loss = 6.4782e-02, PNorm = 119.2162, GNorm = 2.3250, lr_0 = 6.3018e-04
Validation binary_cross_entropy = 0.064337
Epoch 359
Loss = 7.1931e-03, PNorm = 119.3509, GNorm = 0.1938, lr_0 = 6.3152e-04
Loss = 4.1927e-02, PNorm = 119.4654, GNorm = 1.7097, lr_0 = 6.3287e-04
Validation binary_cross_entropy = 0.068238
Epoch 360
Loss = 6.2010e-02, PNorm = 119.5800, GNorm = 6.5460, lr_0 = 6.3422e-04
Validation binary_cross_entropy = 0.089541
Epoch 361
Loss = 1.0840e-01, PNorm = 119.6821, GNorm = 1.1687, lr_0 = 6.3556e-04
Validation binary_cross_entropy = 0.038520
Epoch 362
Loss = 6.2915e-02, PNorm = 119.8230, GNorm = 1.2235, lr_0 = 6.3691e-04
Validation binary_cross_entropy = 0.039228
Epoch 363
Loss = 3.6764e-02, PNorm = 119.9518, GNorm = 1.3589, lr_0 = 6.3825e-04
Validation binary_cross_entropy = 0.109405
Epoch 364
Loss = 3.0920e-02, PNorm = 120.0545, GNorm = 0.0898, lr_0 = 6.3960e-04
Validation binary_cross_entropy = 0.061601
Epoch 365
Loss = 8.1064e-03, PNorm = 120.1365, GNorm = 1.2788, lr_0 = 6.4094e-04
Validation binary_cross_entropy = 0.053436
Epoch 366
Loss = 3.5696e-02, PNorm = 120.2174, GNorm = 1.9394, lr_0 = 6.4229e-04
Validation binary_cross_entropy = 0.068214
Epoch 367
Loss = 9.8851e-02, PNorm = 120.2979, GNorm = 4.1317, lr_0 = 6.4363e-04
Validation binary_cross_entropy = 0.078788
Epoch 368
Loss = 3.6030e-02, PNorm = 120.4211, GNorm = 0.4897, lr_0 = 6.4498e-04
Validation binary_cross_entropy = 0.076434
Epoch 369
Loss = 8.0649e-02, PNorm = 120.5552, GNorm = 1.3823, lr_0 = 6.4632e-04
Loss = 4.8878e-02, PNorm = 120.6634, GNorm = 0.9571, lr_0 = 6.4767e-04
Validation binary_cross_entropy = 0.092371
Epoch 370
Loss = 4.6171e-02, PNorm = 120.7653, GNorm = 1.5852, lr_0 = 6.4901e-04
Validation binary_cross_entropy = 0.075394
Epoch 371
Loss = 7.2499e-02, PNorm = 120.8626, GNorm = 0.4849, lr_0 = 6.5036e-04
Validation binary_cross_entropy = 0.110736
Epoch 372
Loss = 7.8330e-02, PNorm = 120.9784, GNorm = 1.4382, lr_0 = 6.5170e-04
Validation binary_cross_entropy = 0.112846
Epoch 373
Loss = 1.0855e-01, PNorm = 121.0961, GNorm = 1.4085, lr_0 = 6.5305e-04
Validation binary_cross_entropy = 0.237569
Epoch 374
Loss = 9.6908e-02, PNorm = 121.2365, GNorm = 1.6567, lr_0 = 6.5439e-04
Validation binary_cross_entropy = 0.060909
Epoch 375
Loss = 3.1428e-02, PNorm = 121.3646, GNorm = 0.5459, lr_0 = 6.5574e-04
Validation binary_cross_entropy = 0.072075
Epoch 376
Loss = 3.3399e-02, PNorm = 121.4689, GNorm = 2.4454, lr_0 = 6.5709e-04
Validation binary_cross_entropy = 0.078696
Epoch 377
Loss = 3.9174e-02, PNorm = 121.5527, GNorm = 2.7778, lr_0 = 6.5843e-04
Validation binary_cross_entropy = 0.087190
Epoch 378
Loss = 3.1809e-02, PNorm = 121.6977, GNorm = 1.6776, lr_0 = 6.5978e-04
Validation binary_cross_entropy = 0.048157
Epoch 379
Loss = 1.5270e-02, PNorm = 121.8394, GNorm = 0.3671, lr_0 = 6.6112e-04
Loss = 2.0750e-02, PNorm = 121.9711, GNorm = 0.1155, lr_0 = 6.6247e-04
Validation binary_cross_entropy = 0.191395
Epoch 380
Loss = 9.6611e-02, PNorm = 122.0420, GNorm = 1.6089, lr_0 = 6.6381e-04
Validation binary_cross_entropy = 0.058654
Epoch 381
Loss = 4.4973e-02, PNorm = 122.1358, GNorm = 1.0336, lr_0 = 6.6516e-04
Validation binary_cross_entropy = 0.075631
Epoch 382
Loss = 5.4122e-02, PNorm = 122.2293, GNorm = 0.2362, lr_0 = 6.6650e-04
Validation binary_cross_entropy = 0.045137
Epoch 383
Loss = 7.7607e-02, PNorm = 122.3289, GNorm = 1.4484, lr_0 = 6.6785e-04
Validation binary_cross_entropy = 0.061713
Epoch 384
Loss = 5.6775e-02, PNorm = 122.4423, GNorm = 1.0987, lr_0 = 6.6919e-04
Validation binary_cross_entropy = 0.032230
Epoch 385
Loss = 5.1895e-02, PNorm = 122.5422, GNorm = 0.2516, lr_0 = 6.7054e-04
Validation binary_cross_entropy = 0.073824
Epoch 386
Loss = 7.2324e-02, PNorm = 122.6776, GNorm = 1.4849, lr_0 = 6.7188e-04
Validation binary_cross_entropy = 0.091086
Epoch 387
Loss = 1.1138e-01, PNorm = 122.8136, GNorm = 1.4790, lr_0 = 6.7323e-04
Validation binary_cross_entropy = 0.059618
Epoch 388
Loss = 7.5820e-03, PNorm = 122.9233, GNorm = 0.0654, lr_0 = 6.7457e-04
Validation binary_cross_entropy = 0.031643
Epoch 389
Loss = 1.3206e-01, PNorm = 122.9997, GNorm = 1.6504, lr_0 = 6.7592e-04
Loss = 2.6360e-02, PNorm = 123.0973, GNorm = 0.0962, lr_0 = 6.7726e-04
Validation binary_cross_entropy = 0.080223
Epoch 390
Loss = 4.2983e-02, PNorm = 123.1680, GNorm = 1.7149, lr_0 = 6.7861e-04
Validation binary_cross_entropy = 0.047460
Epoch 391
Loss = 4.2586e-02, PNorm = 123.2275, GNorm = 1.8663, lr_0 = 6.7996e-04
Validation binary_cross_entropy = 0.067505
Epoch 392
Loss = 4.6370e-02, PNorm = 123.3127, GNorm = 1.1548, lr_0 = 6.8130e-04
Validation binary_cross_entropy = 0.048576
Epoch 393
Loss = 2.2930e-02, PNorm = 123.4184, GNorm = 0.2671, lr_0 = 6.8265e-04
Validation binary_cross_entropy = 0.076772
Epoch 394
Loss = 1.1721e-02, PNorm = 123.5067, GNorm = 1.3708, lr_0 = 6.8399e-04
Validation binary_cross_entropy = 0.051528
Epoch 395
Loss = 5.9832e-02, PNorm = 123.5881, GNorm = 0.8855, lr_0 = 6.8534e-04
Validation binary_cross_entropy = 0.051623
Epoch 396
Loss = 3.6656e-02, PNorm = 123.7000, GNorm = 0.6307, lr_0 = 6.8668e-04
Validation binary_cross_entropy = 0.049187
Epoch 397
Loss = 1.8387e-02, PNorm = 123.8237, GNorm = 0.1170, lr_0 = 6.8803e-04
Validation binary_cross_entropy = 0.071474
Epoch 398
Loss = 4.7593e-02, PNorm = 123.9140, GNorm = 0.9301, lr_0 = 6.8937e-04
Validation binary_cross_entropy = 0.042510
Epoch 399
Loss = 2.0381e-02, PNorm = 123.9883, GNorm = 0.5356, lr_0 = 6.9072e-04
Loss = 3.4926e-02, PNorm = 124.0703, GNorm = 0.2630, lr_0 = 6.9206e-04
Validation binary_cross_entropy = 0.084314
Epoch 400
Loss = 3.1529e-02, PNorm = 124.1529, GNorm = 0.3414, lr_0 = 6.9341e-04
Validation binary_cross_entropy = 0.035822
Epoch 401
Loss = 4.3238e-02, PNorm = 124.2512, GNorm = 2.7792, lr_0 = 6.9475e-04
Validation binary_cross_entropy = 0.078064
Epoch 402
Loss = 3.1954e-02, PNorm = 124.4503, GNorm = 0.4035, lr_0 = 6.9610e-04
Validation binary_cross_entropy = 0.082885
Epoch 403
Loss = 4.1466e-02, PNorm = 124.6753, GNorm = 0.6179, lr_0 = 6.9744e-04
Validation binary_cross_entropy = 0.070392
Epoch 404
Loss = 7.4889e-02, PNorm = 124.8287, GNorm = 2.4749, lr_0 = 6.9879e-04
Validation binary_cross_entropy = 0.155469
Epoch 405
Loss = 1.0736e-01, PNorm = 124.9831, GNorm = 3.0442, lr_0 = 7.0013e-04
Validation binary_cross_entropy = 0.151351
Epoch 406
Loss = 8.2078e-02, PNorm = 125.1484, GNorm = 1.2583, lr_0 = 7.0148e-04
Validation binary_cross_entropy = 0.065089
Epoch 407
Loss = 4.3033e-02, PNorm = 125.3057, GNorm = 0.1732, lr_0 = 7.0283e-04
Validation binary_cross_entropy = 0.064883
Epoch 408
Loss = 8.6772e-02, PNorm = 125.4310, GNorm = 2.1654, lr_0 = 7.0417e-04
Validation binary_cross_entropy = 0.218349
Epoch 409
Loss = 8.7766e-03, PNorm = 125.5281, GNorm = 0.3022, lr_0 = 7.0552e-04
Loss = 6.9632e-02, PNorm = 125.6605, GNorm = 1.7015, lr_0 = 7.0686e-04
Validation binary_cross_entropy = 0.086832
Epoch 410
Loss = 5.4356e-02, PNorm = 125.7964, GNorm = 0.6987, lr_0 = 7.0821e-04
Validation binary_cross_entropy = 0.047668
Epoch 411
Loss = 5.5352e-02, PNorm = 125.9200, GNorm = 2.1099, lr_0 = 7.0955e-04
Validation binary_cross_entropy = 0.074751
Epoch 412
Loss = 9.0978e-02, PNorm = 126.0460, GNorm = 0.6362, lr_0 = 7.1090e-04
Validation binary_cross_entropy = 0.115381
Epoch 413
Loss = 6.9188e-02, PNorm = 126.1852, GNorm = 0.5550, lr_0 = 7.1224e-04
Validation binary_cross_entropy = 0.077305
Epoch 414
Loss = 6.8349e-02, PNorm = 126.3320, GNorm = 1.4989, lr_0 = 7.1359e-04
Validation binary_cross_entropy = 0.080180
Epoch 415
Loss = 8.3688e-02, PNorm = 126.4729, GNorm = 2.2201, lr_0 = 7.1493e-04
Validation binary_cross_entropy = 0.087611
Epoch 416
Loss = 7.8295e-02, PNorm = 126.5902, GNorm = 1.5394, lr_0 = 7.1628e-04
Validation binary_cross_entropy = 0.038262
Epoch 417
Loss = 4.1700e-02, PNorm = 126.7421, GNorm = 0.4518, lr_0 = 7.1762e-04
Validation binary_cross_entropy = 0.037283
Epoch 418
Loss = 1.7555e-02, PNorm = 126.8976, GNorm = 0.4846, lr_0 = 7.1897e-04
Validation binary_cross_entropy = 0.047582
Epoch 419
Loss = 1.5585e-02, PNorm = 127.0378, GNorm = 0.2652, lr_0 = 7.2031e-04
Loss = 7.3065e-02, PNorm = 127.1382, GNorm = 1.5800, lr_0 = 7.2166e-04
Validation binary_cross_entropy = 0.064582
Epoch 420
Loss = 9.1655e-02, PNorm = 127.2459, GNorm = 0.4787, lr_0 = 7.2300e-04
Validation binary_cross_entropy = 0.044711
Epoch 421
Loss = 3.6634e-02, PNorm = 127.3929, GNorm = 0.5674, lr_0 = 7.2435e-04
Validation binary_cross_entropy = 0.050270
Epoch 422
Loss = 2.3049e-02, PNorm = 127.5741, GNorm = 0.7266, lr_0 = 7.2570e-04
Validation binary_cross_entropy = 0.092690
Epoch 423
Loss = 5.3647e-02, PNorm = 127.7440, GNorm = 0.0856, lr_0 = 7.2704e-04
Validation binary_cross_entropy = 0.064685
Epoch 424
Loss = 5.5818e-02, PNorm = 127.9179, GNorm = 0.7536, lr_0 = 7.2839e-04
Validation binary_cross_entropy = 0.053473
Epoch 425
Loss = 3.6076e-02, PNorm = 128.0563, GNorm = 0.8348, lr_0 = 7.2973e-04
Validation binary_cross_entropy = 0.093032
Epoch 426
Loss = 4.8160e-02, PNorm = 128.1784, GNorm = 0.0747, lr_0 = 7.3108e-04
Validation binary_cross_entropy = 0.045977
Epoch 427
Loss = 1.1003e-02, PNorm = 128.2763, GNorm = 0.1653, lr_0 = 7.3242e-04
Validation binary_cross_entropy = 0.054483
Epoch 428
Loss = 6.1043e-02, PNorm = 128.3973, GNorm = 2.6065, lr_0 = 7.3377e-04
Validation binary_cross_entropy = 0.058509
Epoch 429
Loss = 5.9843e-03, PNorm = 128.5332, GNorm = 0.1420, lr_0 = 7.3511e-04
Loss = 2.5252e-02, PNorm = 128.6621, GNorm = 3.6422, lr_0 = 7.3646e-04
Validation binary_cross_entropy = 0.063288
Epoch 430
Loss = 4.9971e-02, PNorm = 128.7996, GNorm = 4.4323, lr_0 = 7.3780e-04
Validation binary_cross_entropy = 0.069024
Epoch 431
Loss = 8.0047e-02, PNorm = 129.0212, GNorm = 1.5185, lr_0 = 7.3915e-04
Validation binary_cross_entropy = 0.109858
Epoch 432
Loss = 1.1054e-01, PNorm = 129.2532, GNorm = 0.8087, lr_0 = 7.4049e-04
Validation binary_cross_entropy = 0.100287
Epoch 433
Loss = 7.8165e-02, PNorm = 129.5079, GNorm = 1.4615, lr_0 = 7.4184e-04
Validation binary_cross_entropy = 0.054230
Epoch 434
Loss = 8.4743e-02, PNorm = 129.6825, GNorm = 2.8530, lr_0 = 7.4318e-04
Validation binary_cross_entropy = 0.049887
Epoch 435
Loss = 2.1437e-02, PNorm = 129.8740, GNorm = 1.2950, lr_0 = 7.4453e-04
Validation binary_cross_entropy = 0.077780
Epoch 436
Loss = 5.5461e-02, PNorm = 130.0162, GNorm = 0.9406, lr_0 = 7.4587e-04
Validation binary_cross_entropy = 0.067825
Epoch 437
Loss = 1.7432e-02, PNorm = 130.1419, GNorm = 0.8074, lr_0 = 7.4722e-04
Validation binary_cross_entropy = 0.069979
Epoch 438
Loss = 5.5616e-02, PNorm = 130.2802, GNorm = 2.2646, lr_0 = 7.4857e-04
Validation binary_cross_entropy = 0.093420
Epoch 439
Loss = 3.3824e-02, PNorm = 130.4503, GNorm = 2.6093, lr_0 = 7.4991e-04
Loss = 8.9078e-02, PNorm = 130.6701, GNorm = 1.0942, lr_0 = 7.5126e-04
Validation binary_cross_entropy = 0.083555
Epoch 440
Loss = 7.0345e-02, PNorm = 130.9088, GNorm = 1.8360, lr_0 = 7.5260e-04
Validation binary_cross_entropy = 0.101410
Epoch 441
Loss = 7.0585e-02, PNorm = 131.1257, GNorm = 0.3769, lr_0 = 7.5395e-04
Validation binary_cross_entropy = 0.138787
Epoch 442
Loss = 5.0056e-02, PNorm = 131.3284, GNorm = 0.2170, lr_0 = 7.5529e-04
Validation binary_cross_entropy = 0.045410
Epoch 443
Loss = 3.3563e-02, PNorm = 131.4872, GNorm = 0.0678, lr_0 = 7.5664e-04
Validation binary_cross_entropy = 0.062860
Epoch 444
Loss = 5.7456e-02, PNorm = 131.6196, GNorm = 0.7486, lr_0 = 7.5798e-04
Validation binary_cross_entropy = 0.054864
Epoch 445
Loss = 4.5770e-02, PNorm = 131.7585, GNorm = 1.1015, lr_0 = 7.5933e-04
Validation binary_cross_entropy = 0.139627
Epoch 446
Loss = 6.9846e-02, PNorm = 131.8974, GNorm = 0.8502, lr_0 = 7.6067e-04
Validation binary_cross_entropy = 0.051073
Epoch 447
Loss = 6.4242e-02, PNorm = 132.0639, GNorm = 0.9180, lr_0 = 7.6202e-04
Validation binary_cross_entropy = 0.040496
Epoch 448
Loss = 6.2294e-02, PNorm = 132.1972, GNorm = 0.9281, lr_0 = 7.6336e-04
Validation binary_cross_entropy = 0.050352
Epoch 449
Loss = 2.0678e-01, PNorm = 132.3021, GNorm = 2.3806, lr_0 = 7.6471e-04
Loss = 6.0347e-02, PNorm = 132.3978, GNorm = 1.7800, lr_0 = 7.6605e-04
Validation binary_cross_entropy = 0.071519
Epoch 450
Loss = 3.2960e-02, PNorm = 132.5023, GNorm = 1.6651, lr_0 = 7.6740e-04
Validation binary_cross_entropy = 0.052993
Epoch 451
Loss = 1.9888e-02, PNorm = 132.6251, GNorm = 0.3030, lr_0 = 7.6874e-04
Validation binary_cross_entropy = 0.057721
Epoch 452
Loss = 7.2589e-02, PNorm = 132.7577, GNorm = 2.9677, lr_0 = 7.7009e-04
Validation binary_cross_entropy = 0.028044
Epoch 453
Loss = 8.2935e-02, PNorm = 133.0364, GNorm = 0.2726, lr_0 = 7.7143e-04
Validation binary_cross_entropy = 0.101378
Epoch 454
Loss = 9.9436e-02, PNorm = 133.3296, GNorm = 1.3776, lr_0 = 7.7278e-04
Validation binary_cross_entropy = 0.090561
Epoch 455
Loss = 1.4654e-02, PNorm = 133.5504, GNorm = 0.6908, lr_0 = 7.7413e-04
Validation binary_cross_entropy = 0.046737
Epoch 456
Loss = 8.5828e-03, PNorm = 133.7053, GNorm = 0.1125, lr_0 = 7.7547e-04
Validation binary_cross_entropy = 0.053245
Epoch 457
Loss = 6.2727e-03, PNorm = 133.7968, GNorm = 0.2924, lr_0 = 7.7682e-04
Validation binary_cross_entropy = 0.056418
Epoch 458
Loss = 3.8730e-02, PNorm = 133.8707, GNorm = 0.0429, lr_0 = 7.7816e-04
Validation binary_cross_entropy = 0.112748
Epoch 459
Loss = 5.9998e-03, PNorm = 133.9368, GNorm = 0.3920, lr_0 = 7.7951e-04
Loss = 4.3398e-02, PNorm = 134.0234, GNorm = 1.8006, lr_0 = 7.8085e-04
Validation binary_cross_entropy = 0.047360
Epoch 460
Loss = 4.8801e-02, PNorm = 134.1380, GNorm = 0.5074, lr_0 = 7.8220e-04
Validation binary_cross_entropy = 0.032974
Epoch 461
Loss = 4.5559e-02, PNorm = 134.2767, GNorm = 1.1838, lr_0 = 7.8354e-04
Validation binary_cross_entropy = 0.265545
Epoch 462
Loss = 1.1471e-01, PNorm = 134.4162, GNorm = 3.4179, lr_0 = 7.8489e-04
Validation binary_cross_entropy = 0.100982
Epoch 463
Loss = 6.6086e-02, PNorm = 134.5780, GNorm = 0.8672, lr_0 = 7.8623e-04
Validation binary_cross_entropy = 0.060138
Epoch 464
Loss = 4.2943e-02, PNorm = 134.7203, GNorm = 1.5340, lr_0 = 7.8758e-04
Validation binary_cross_entropy = 0.038126
Epoch 465
Loss = 9.6009e-02, PNorm = 134.8328, GNorm = 1.4099, lr_0 = 7.8892e-04
Validation binary_cross_entropy = 0.120887
Epoch 466
Loss = 3.6033e-02, PNorm = 134.9801, GNorm = 1.0601, lr_0 = 7.9027e-04
Validation binary_cross_entropy = 0.103085
Epoch 467
Loss = 6.7550e-02, PNorm = 135.1498, GNorm = 1.3993, lr_0 = 7.9161e-04
Validation binary_cross_entropy = 0.114592
Epoch 468
Loss = 3.5234e-02, PNorm = 135.3033, GNorm = 2.9070, lr_0 = 7.9296e-04
Validation binary_cross_entropy = 0.082584
Epoch 469
Loss = 1.3639e-01, PNorm = 135.4532, GNorm = 1.5028, lr_0 = 7.9430e-04
Loss = 1.1688e-01, PNorm = 135.5892, GNorm = 0.9161, lr_0 = 7.9565e-04
Validation binary_cross_entropy = 0.091284
Epoch 470
Loss = 8.0520e-02, PNorm = 135.7432, GNorm = 0.5080, lr_0 = 7.9700e-04
Validation binary_cross_entropy = 0.094836
Epoch 471
Loss = 4.8875e-02, PNorm = 135.9184, GNorm = 0.2589, lr_0 = 7.9834e-04
Validation binary_cross_entropy = 0.067637
Epoch 472
Loss = 6.8933e-02, PNorm = 136.0691, GNorm = 1.2252, lr_0 = 7.9969e-04
Validation binary_cross_entropy = 0.037906
Epoch 473
Loss = 3.2221e-02, PNorm = 136.2198, GNorm = 1.1304, lr_0 = 8.0103e-04
Validation binary_cross_entropy = 0.066976
Epoch 474
Loss = 6.0910e-02, PNorm = 136.3471, GNorm = 1.8774, lr_0 = 8.0238e-04
Validation binary_cross_entropy = 0.101313
Epoch 475
Loss = 2.4119e-02, PNorm = 136.4893, GNorm = 0.4277, lr_0 = 8.0372e-04
Validation binary_cross_entropy = 0.067490
Epoch 476
Loss = 6.1326e-02, PNorm = 136.5981, GNorm = 1.6827, lr_0 = 8.0507e-04
Validation binary_cross_entropy = 0.033236
Epoch 477
Loss = 4.7146e-02, PNorm = 136.7088, GNorm = 0.7908, lr_0 = 8.0641e-04
Validation binary_cross_entropy = 0.051975
Epoch 478
Loss = 2.4740e-02, PNorm = 136.8247, GNorm = 0.0346, lr_0 = 8.0776e-04
Validation binary_cross_entropy = 0.066024
Epoch 479
Loss = 9.1648e-02, PNorm = 136.9435, GNorm = 1.3671, lr_0 = 8.0910e-04
Loss = 5.1597e-02, PNorm = 137.0767, GNorm = 0.8519, lr_0 = 8.1045e-04
Validation binary_cross_entropy = 0.047602
Epoch 480
Loss = 5.4423e-02, PNorm = 137.2244, GNorm = 0.6869, lr_0 = 8.1179e-04
Validation binary_cross_entropy = 0.089192
Epoch 481
Loss = 4.9420e-02, PNorm = 137.3572, GNorm = 1.6322, lr_0 = 8.1314e-04
Validation binary_cross_entropy = 0.066803
Epoch 482
Loss = 4.1522e-02, PNorm = 137.4855, GNorm = 1.4918, lr_0 = 8.1448e-04
Validation binary_cross_entropy = 0.047186
Epoch 483
Loss = 2.3498e-02, PNorm = 137.6120, GNorm = 1.1331, lr_0 = 8.1583e-04
Validation binary_cross_entropy = 0.079919
Epoch 484
Loss = 1.2622e-01, PNorm = 137.7291, GNorm = 1.1124, lr_0 = 8.1717e-04
Validation binary_cross_entropy = 0.125168
Epoch 485
Loss = 4.3248e-02, PNorm = 137.8761, GNorm = 1.1406, lr_0 = 8.1852e-04
Validation binary_cross_entropy = 0.040316
Epoch 486
Loss = 1.3693e-02, PNorm = 138.0400, GNorm = 0.2060, lr_0 = 8.1987e-04
Validation binary_cross_entropy = 0.064673
Epoch 487
Loss = 8.6122e-02, PNorm = 138.1852, GNorm = 0.0848, lr_0 = 8.2121e-04
Validation binary_cross_entropy = 0.071483
Epoch 488
Loss = 9.0455e-02, PNorm = 138.3057, GNorm = 0.4951, lr_0 = 8.2256e-04
Validation binary_cross_entropy = 0.069779
Epoch 489
Loss = 1.5060e-02, PNorm = 138.4146, GNorm = 0.7203, lr_0 = 8.2390e-04
Loss = 2.0800e-02, PNorm = 138.5308, GNorm = 0.1319, lr_0 = 8.2525e-04
Validation binary_cross_entropy = 0.074866
Epoch 490
Loss = 3.7425e-02, PNorm = 138.6341, GNorm = 1.7225, lr_0 = 8.2659e-04
Validation binary_cross_entropy = 0.054729
Epoch 491
Loss = 5.8899e-02, PNorm = 138.7746, GNorm = 0.6540, lr_0 = 8.2794e-04
Validation binary_cross_entropy = 0.135456
Epoch 492
Loss = 6.6137e-02, PNorm = 138.9617, GNorm = 2.1983, lr_0 = 8.2928e-04
Validation binary_cross_entropy = 0.103870
Epoch 493
Loss = 3.8036e-02, PNorm = 139.1285, GNorm = 0.3103, lr_0 = 8.3063e-04
Validation binary_cross_entropy = 0.120772
Epoch 494
Loss = 2.9012e-02, PNorm = 139.3142, GNorm = 1.2582, lr_0 = 8.3197e-04
Validation binary_cross_entropy = 0.102854
Epoch 495
Loss = 4.7075e-02, PNorm = 139.4771, GNorm = 1.6109, lr_0 = 8.3332e-04
Validation binary_cross_entropy = 0.094031
Epoch 496
Loss = 6.3035e-02, PNorm = 139.6317, GNorm = 0.1663, lr_0 = 8.3466e-04
Validation binary_cross_entropy = 0.128637
Epoch 497
Loss = 1.1932e-01, PNorm = 139.8108, GNorm = 2.7447, lr_0 = 8.3601e-04
Validation binary_cross_entropy = 0.076140
Epoch 498
Loss = 2.8187e-02, PNorm = 140.0044, GNorm = 0.6870, lr_0 = 8.3735e-04
Validation binary_cross_entropy = 0.053358
Epoch 499
Loss = 4.6512e-02, PNorm = 140.1663, GNorm = 1.2578, lr_0 = 8.3870e-04
Loss = 3.3254e-02, PNorm = 140.3094, GNorm = 0.5869, lr_0 = 8.4004e-04
Validation binary_cross_entropy = 0.071687
Epoch 500
Loss = 2.2594e-02, PNorm = 140.4407, GNorm = 2.1855, lr_0 = 8.4139e-04
Validation binary_cross_entropy = 0.071046
Epoch 501
Loss = 1.9124e-02, PNorm = 140.5692, GNorm = 1.1896, lr_0 = 8.4274e-04
Validation binary_cross_entropy = 0.079635
Epoch 502
Loss = 6.8659e-02, PNorm = 140.7239, GNorm = 2.7002, lr_0 = 8.4408e-04
Validation binary_cross_entropy = 0.045148
Epoch 503
Loss = 5.5745e-02, PNorm = 140.8985, GNorm = 1.2280, lr_0 = 8.4543e-04
Validation binary_cross_entropy = 0.068588
Epoch 504
Loss = 5.4435e-02, PNorm = 141.0682, GNorm = 3.4876, lr_0 = 8.4677e-04
Validation binary_cross_entropy = 0.163291
Epoch 505
Loss = 3.6581e-02, PNorm = 141.2611, GNorm = 0.7871, lr_0 = 8.4812e-04
Validation binary_cross_entropy = 0.147287
Epoch 506
Loss = 5.3167e-02, PNorm = 141.4584, GNorm = 1.6765, lr_0 = 8.4946e-04
Validation binary_cross_entropy = 0.311528
Epoch 507
Loss = 1.3108e-01, PNorm = 141.7020, GNorm = 2.4672, lr_0 = 8.5081e-04
Validation binary_cross_entropy = 0.110694
Epoch 508
Loss = 7.9099e-02, PNorm = 142.0646, GNorm = 1.2739, lr_0 = 8.5215e-04
Validation binary_cross_entropy = 0.055767
Epoch 509
Loss = 1.0216e-01, PNorm = 142.4095, GNorm = 0.9338, lr_0 = 8.5350e-04
Loss = 6.4979e-02, PNorm = 142.6923, GNorm = 2.0357, lr_0 = 8.5484e-04
Validation binary_cross_entropy = 0.048713
Epoch 510
Loss = 3.9701e-02, PNorm = 142.9200, GNorm = 0.9512, lr_0 = 8.5619e-04
Validation binary_cross_entropy = 0.072496
Epoch 511
Loss = 5.6284e-02, PNorm = 143.1109, GNorm = 1.4137, lr_0 = 8.5753e-04
Validation binary_cross_entropy = 0.097365
Epoch 512
Loss = 6.1492e-02, PNorm = 143.2769, GNorm = 0.2951, lr_0 = 8.5888e-04
Validation binary_cross_entropy = 0.081987
Epoch 513
Loss = 4.7209e-02, PNorm = 143.4701, GNorm = 1.0469, lr_0 = 8.6022e-04
Validation binary_cross_entropy = 0.083489
Epoch 514
Loss = 6.5780e-02, PNorm = 143.6700, GNorm = 2.0692, lr_0 = 8.6157e-04
Validation binary_cross_entropy = 0.053776
Epoch 515
Loss = 5.2856e-02, PNorm = 143.8262, GNorm = 0.9314, lr_0 = 8.6291e-04
Validation binary_cross_entropy = 0.105923
Epoch 516
Loss = 5.1964e-02, PNorm = 143.9922, GNorm = 1.1554, lr_0 = 8.6426e-04
Validation binary_cross_entropy = 0.051668
Epoch 517
Loss = 1.5818e-02, PNorm = 144.1746, GNorm = 0.6384, lr_0 = 8.6561e-04
Validation binary_cross_entropy = 0.068294
Epoch 518
Loss = 1.3874e-02, PNorm = 144.3298, GNorm = 0.1721, lr_0 = 8.6695e-04
Validation binary_cross_entropy = 0.121531
Epoch 519
Loss = 3.5167e-03, PNorm = 144.4979, GNorm = 0.1294, lr_0 = 8.6830e-04
Loss = 6.0214e-02, PNorm = 144.6938, GNorm = 1.1256, lr_0 = 8.6964e-04
Validation binary_cross_entropy = 0.067019
Epoch 520
Loss = 9.8983e-02, PNorm = 144.9803, GNorm = 2.1486, lr_0 = 8.7099e-04
Validation binary_cross_entropy = 0.040155
Epoch 521
Loss = 7.3361e-02, PNorm = 145.2595, GNorm = 0.8123, lr_0 = 8.7233e-04
Validation binary_cross_entropy = 0.057258
Epoch 522
Loss = 6.1392e-02, PNorm = 145.4834, GNorm = 1.0259, lr_0 = 8.7368e-04
Validation binary_cross_entropy = 0.057170
Epoch 523
Loss = 5.7205e-02, PNorm = 145.6593, GNorm = 1.6400, lr_0 = 8.7502e-04
Validation binary_cross_entropy = 0.090500
Epoch 524
Loss = 5.3961e-02, PNorm = 145.8063, GNorm = 0.9256, lr_0 = 8.7637e-04
Validation binary_cross_entropy = 0.067301
Epoch 525
Loss = 5.7759e-02, PNorm = 145.9374, GNorm = 2.4153, lr_0 = 8.7771e-04
Validation binary_cross_entropy = 0.028828
Epoch 526
Loss = 4.6064e-02, PNorm = 146.1007, GNorm = 0.7987, lr_0 = 8.7906e-04
Validation binary_cross_entropy = 0.065362
Epoch 527
Loss = 5.8623e-02, PNorm = 146.2558, GNorm = 0.5592, lr_0 = 8.8040e-04
Validation binary_cross_entropy = 0.078834
Epoch 528
Loss = 7.2478e-02, PNorm = 146.3833, GNorm = 0.0343, lr_0 = 8.8175e-04
Validation binary_cross_entropy = 0.066944
Epoch 529
Loss = 1.1180e-01, PNorm = 146.5102, GNorm = 1.8576, lr_0 = 8.8309e-04
Loss = 4.2755e-02, PNorm = 146.6728, GNorm = 0.1483, lr_0 = 8.8444e-04
Validation binary_cross_entropy = 0.094740
Epoch 530
Loss = 6.8699e-02, PNorm = 146.8309, GNorm = 2.3938, lr_0 = 8.8578e-04
Validation binary_cross_entropy = 0.062106
Epoch 531
Loss = 6.6435e-02, PNorm = 146.9759, GNorm = 0.8507, lr_0 = 8.8713e-04
Validation binary_cross_entropy = 0.056864
Epoch 532
Loss = 3.9175e-02, PNorm = 147.1355, GNorm = 0.3692, lr_0 = 8.8848e-04
Validation binary_cross_entropy = 0.058626
Epoch 533
Loss = 4.9254e-02, PNorm = 147.2808, GNorm = 0.9973, lr_0 = 8.8982e-04
Validation binary_cross_entropy = 0.063596
Epoch 534
Loss = 4.3291e-02, PNorm = 147.4138, GNorm = 1.7644, lr_0 = 8.9117e-04
Validation binary_cross_entropy = 0.067951
Epoch 535
Loss = 1.3882e-01, PNorm = 147.5902, GNorm = 1.3834, lr_0 = 8.9251e-04
Validation binary_cross_entropy = 0.085914
Epoch 536
Loss = 9.1826e-02, PNorm = 147.8773, GNorm = 0.7912, lr_0 = 8.9386e-04
Validation binary_cross_entropy = 0.054664
Epoch 537
Loss = 5.0429e-02, PNorm = 148.1350, GNorm = 1.5457, lr_0 = 8.9520e-04
Validation binary_cross_entropy = 0.070247
Epoch 538
Loss = 5.6834e-03, PNorm = 148.3272, GNorm = 0.3883, lr_0 = 8.9655e-04
Validation binary_cross_entropy = 0.042626
Epoch 539
Loss = 9.5002e-03, PNorm = 148.4937, GNorm = 0.2134, lr_0 = 8.9789e-04
Loss = 4.1245e-02, PNorm = 148.6578, GNorm = 0.6301, lr_0 = 8.9924e-04
Validation binary_cross_entropy = 0.071181
Epoch 540
Loss = 3.7337e-02, PNorm = 148.8003, GNorm = 1.0910, lr_0 = 9.0058e-04
Validation binary_cross_entropy = 0.067001
Epoch 541
Loss = 5.5263e-02, PNorm = 148.9189, GNorm = 2.2687, lr_0 = 9.0193e-04
Validation binary_cross_entropy = 0.109621
Epoch 542
Loss = 6.0660e-02, PNorm = 149.0591, GNorm = 0.9377, lr_0 = 9.0327e-04
Validation binary_cross_entropy = 0.042809
Epoch 543
Loss = 5.8189e-02, PNorm = 149.2062, GNorm = 2.4443, lr_0 = 9.0462e-04
Validation binary_cross_entropy = 0.094741
Epoch 544
Loss = 3.1991e-02, PNorm = 149.3403, GNorm = 0.5034, lr_0 = 9.0596e-04
Validation binary_cross_entropy = 0.101768
Epoch 545
Loss = 5.0924e-02, PNorm = 149.4854, GNorm = 0.1927, lr_0 = 9.0731e-04
Validation binary_cross_entropy = 0.038158
Epoch 546
Loss = 2.3980e-02, PNorm = 149.6560, GNorm = 0.1501, lr_0 = 9.0865e-04
Validation binary_cross_entropy = 0.078695
Epoch 547
Loss = 2.5807e-02, PNorm = 149.7878, GNorm = 1.0110, lr_0 = 9.1000e-04
Validation binary_cross_entropy = 0.049765
Epoch 548
Loss = 9.8777e-03, PNorm = 149.8909, GNorm = 0.7040, lr_0 = 9.1135e-04
Validation binary_cross_entropy = 0.056868
Epoch 549
Loss = 2.8490e-03, PNorm = 149.9970, GNorm = 0.0807, lr_0 = 9.1269e-04
Loss = 4.9199e-03, PNorm = 150.0802, GNorm = 0.1164, lr_0 = 9.1404e-04
Validation binary_cross_entropy = 0.087565
Epoch 550
Loss = 2.8069e-02, PNorm = 150.1307, GNorm = 0.9679, lr_0 = 9.1538e-04
Validation binary_cross_entropy = 0.098829
Epoch 551
Loss = 2.4546e-02, PNorm = 150.2028, GNorm = 0.2009, lr_0 = 9.1673e-04
Validation binary_cross_entropy = 0.079943
Epoch 552
Loss = 4.9665e-02, PNorm = 150.3385, GNorm = 3.4118, lr_0 = 9.1807e-04
Validation binary_cross_entropy = 0.127618
Epoch 553
Loss = 5.1286e-02, PNorm = 150.5234, GNorm = 1.6051, lr_0 = 9.1942e-04
Validation binary_cross_entropy = 0.060069
Epoch 554
Loss = 3.4246e-02, PNorm = 150.7419, GNorm = 1.2318, lr_0 = 9.2076e-04
Validation binary_cross_entropy = 0.070255
Epoch 555
Loss = 5.0375e-02, PNorm = 150.9067, GNorm = 0.1528, lr_0 = 9.2211e-04
Validation binary_cross_entropy = 0.114990
Epoch 556
Loss = 4.6345e-02, PNorm = 151.0436, GNorm = 1.2460, lr_0 = 9.2345e-04
Validation binary_cross_entropy = 0.056409
Epoch 557
Loss = 4.0638e-02, PNorm = 151.1780, GNorm = 0.0957, lr_0 = 9.2480e-04
Validation binary_cross_entropy = 0.090011
Epoch 558
Loss = 4.9569e-03, PNorm = 151.2980, GNorm = 0.1846, lr_0 = 9.2614e-04
Validation binary_cross_entropy = 0.085109
Epoch 559
Loss = 4.5426e-03, PNorm = 151.4193, GNorm = 0.1374, lr_0 = 9.2749e-04
Loss = 3.1289e-02, PNorm = 151.5751, GNorm = 1.2526, lr_0 = 9.2883e-04
Validation binary_cross_entropy = 0.051510
Epoch 560
Loss = 4.0318e-02, PNorm = 151.7302, GNorm = 1.5553, lr_0 = 9.3018e-04
Validation binary_cross_entropy = 0.178917
Epoch 561
Loss = 4.0558e-02, PNorm = 151.9360, GNorm = 1.2381, lr_0 = 9.3152e-04
Validation binary_cross_entropy = 0.094509
Epoch 562
Loss = 3.3526e-02, PNorm = 152.1639, GNorm = 0.4609, lr_0 = 9.3287e-04
Validation binary_cross_entropy = 0.084274
Epoch 563
Loss = 7.9065e-02, PNorm = 152.3729, GNorm = 0.7930, lr_0 = 9.3422e-04
Validation binary_cross_entropy = 0.047149
Epoch 564
Loss = 6.6381e-02, PNorm = 152.5942, GNorm = 0.2194, lr_0 = 9.3556e-04
Validation binary_cross_entropy = 0.170155
Epoch 565
Loss = 5.0712e-02, PNorm = 152.8195, GNorm = 0.8679, lr_0 = 9.3691e-04
Validation binary_cross_entropy = 0.147500
Epoch 566
Loss = 3.2915e-02, PNorm = 153.0147, GNorm = 0.3828, lr_0 = 9.3825e-04
Validation binary_cross_entropy = 0.110474
Epoch 567
Loss = 1.4692e-01, PNorm = 153.1884, GNorm = 1.4086, lr_0 = 9.3960e-04
Validation binary_cross_entropy = 0.066863
Epoch 568
Loss = 1.4863e-02, PNorm = 153.4036, GNorm = 0.2318, lr_0 = 9.4094e-04
Validation binary_cross_entropy = 0.050006
Epoch 569
Loss = 7.7824e-02, PNorm = 153.6224, GNorm = 0.8930, lr_0 = 9.4229e-04
Loss = 3.8957e-02, PNorm = 153.8287, GNorm = 0.0333, lr_0 = 9.4363e-04
Validation binary_cross_entropy = 0.120055
Epoch 570
Loss = 4.6823e-02, PNorm = 154.0336, GNorm = 3.5621, lr_0 = 9.4498e-04
Validation binary_cross_entropy = 0.055200
Epoch 571
Loss = 2.4032e-02, PNorm = 154.2461, GNorm = 1.3388, lr_0 = 9.4632e-04
Validation binary_cross_entropy = 0.114424
Epoch 572
Loss = 1.2922e-01, PNorm = 154.4550, GNorm = 0.4914, lr_0 = 9.4767e-04
Validation binary_cross_entropy = 0.112909
Epoch 573
Loss = 8.9734e-02, PNorm = 154.7419, GNorm = 1.0931, lr_0 = 9.4901e-04
Validation binary_cross_entropy = 0.063268
Epoch 574
Loss = 6.6560e-02, PNorm = 154.9840, GNorm = 0.1514, lr_0 = 9.5036e-04
Validation binary_cross_entropy = 0.043956
Epoch 575
Loss = 4.2273e-02, PNorm = 155.1835, GNorm = 3.7730, lr_0 = 9.5170e-04
Validation binary_cross_entropy = 0.051799
Epoch 576
Loss = 5.8554e-02, PNorm = 155.3773, GNorm = 0.9985, lr_0 = 9.5305e-04
Validation binary_cross_entropy = 0.087731
Epoch 577
Loss = 1.0432e-02, PNorm = 155.5634, GNorm = 1.6074, lr_0 = 9.5439e-04
Validation binary_cross_entropy = 0.062657
Epoch 578
Loss = 8.8436e-02, PNorm = 155.7275, GNorm = 0.7793, lr_0 = 9.5574e-04
Validation binary_cross_entropy = 0.044154
Epoch 579
Loss = 1.0246e-01, PNorm = 155.9195, GNorm = 1.8160, lr_0 = 9.5709e-04
Loss = 3.2128e-02, PNorm = 156.1243, GNorm = 1.3710, lr_0 = 9.5843e-04
Validation binary_cross_entropy = 0.070360
Epoch 580
Loss = 7.4290e-02, PNorm = 156.3312, GNorm = 1.0737, lr_0 = 9.5978e-04
Validation binary_cross_entropy = 0.083229
Epoch 581
Loss = 2.9997e-02, PNorm = 156.5380, GNorm = 0.9629, lr_0 = 9.6112e-04
Validation binary_cross_entropy = 0.049348
Epoch 582
Loss = 2.0707e-02, PNorm = 156.7151, GNorm = 0.3626, lr_0 = 9.6247e-04
Validation binary_cross_entropy = 0.091399
Epoch 583
Loss = 8.6348e-02, PNorm = 156.8885, GNorm = 0.0933, lr_0 = 9.6381e-04
Validation binary_cross_entropy = 0.115635
Epoch 584
Loss = 6.4630e-02, PNorm = 157.1385, GNorm = 0.4298, lr_0 = 9.6516e-04
Validation binary_cross_entropy = 0.064269
Epoch 585
Loss = 7.5881e-02, PNorm = 157.3812, GNorm = 1.3273, lr_0 = 9.6650e-04
Validation binary_cross_entropy = 0.033946
Epoch 586
Loss = 6.8242e-02, PNorm = 157.6364, GNorm = 0.4230, lr_0 = 9.6785e-04
Validation binary_cross_entropy = 0.050861
Epoch 587
Loss = 4.6190e-02, PNorm = 157.8777, GNorm = 1.2731, lr_0 = 9.6919e-04
Validation binary_cross_entropy = 0.065744
Epoch 588
Loss = 1.9030e-02, PNorm = 158.1031, GNorm = 0.8842, lr_0 = 9.7054e-04
Validation binary_cross_entropy = 0.038328
Epoch 589
Loss = 4.3969e-02, PNorm = 158.2993, GNorm = 0.8242, lr_0 = 9.7188e-04
Loss = 4.2815e-02, PNorm = 158.4881, GNorm = 2.2544, lr_0 = 9.7323e-04
Validation binary_cross_entropy = 0.046012
Epoch 590
Loss = 7.9672e-02, PNorm = 158.6339, GNorm = 1.3154, lr_0 = 9.7457e-04
Validation binary_cross_entropy = 0.072954
Epoch 591
Loss = 4.0934e-02, PNorm = 158.8112, GNorm = 0.2819, lr_0 = 9.7592e-04
Validation binary_cross_entropy = 0.063474
Epoch 592
Loss = 8.5115e-02, PNorm = 158.9803, GNorm = 0.6294, lr_0 = 9.7726e-04
Validation binary_cross_entropy = 0.052336
Epoch 593
Loss = 2.9714e-02, PNorm = 159.1693, GNorm = 0.2688, lr_0 = 9.7861e-04
Validation binary_cross_entropy = 0.075606
Epoch 594
Loss = 3.8869e-02, PNorm = 159.3424, GNorm = 1.5766, lr_0 = 9.7996e-04
Validation binary_cross_entropy = 0.057573
Epoch 595
Loss = 5.9182e-03, PNorm = 159.5407, GNorm = 0.6148, lr_0 = 9.8130e-04
Validation binary_cross_entropy = 0.044397
Epoch 596
Loss = 7.6382e-02, PNorm = 159.6976, GNorm = 0.2841, lr_0 = 9.8265e-04
Validation binary_cross_entropy = 0.064922
Epoch 597
Loss = 3.3352e-02, PNorm = 159.8698, GNorm = 0.3097, lr_0 = 9.8399e-04
Validation binary_cross_entropy = 0.051061
Epoch 598
Loss = 3.1311e-02, PNorm = 160.0582, GNorm = 1.6767, lr_0 = 9.8534e-04
Validation binary_cross_entropy = 0.102849
Epoch 599
Loss = 3.5298e-03, PNorm = 160.3622, GNorm = 0.2938, lr_0 = 9.8668e-04
Loss = 9.2552e-02, PNorm = 160.7712, GNorm = 1.0470, lr_0 = 9.8803e-04
Validation binary_cross_entropy = 0.077643
Epoch 600
Loss = 7.1031e-02, PNorm = 161.2124, GNorm = 0.7647, lr_0 = 9.8937e-04
Validation binary_cross_entropy = 0.155054
Epoch 601
Loss = 5.1205e-02, PNorm = 161.5744, GNorm = 1.4027, lr_0 = 9.9072e-04
Validation binary_cross_entropy = 0.068227
Epoch 602
Loss = 6.3662e-02, PNorm = 161.8764, GNorm = 3.2617, lr_0 = 9.9206e-04
Validation binary_cross_entropy = 0.067258
Epoch 603
Loss = 9.8015e-02, PNorm = 162.2197, GNorm = 0.8053, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.184565
Epoch 604
Loss = 1.0956e-01, PNorm = 162.5491, GNorm = 2.6218, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.121967
Epoch 605
Loss = 3.7373e-02, PNorm = 162.8477, GNorm = 1.8110, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.045527
Epoch 606
Loss = 6.2661e-02, PNorm = 163.1154, GNorm = 1.7828, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.047527
Epoch 607
Loss = 3.8238e-02, PNorm = 163.3998, GNorm = 0.4075, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.239438
Epoch 608
Loss = 1.2461e-01, PNorm = 163.6564, GNorm = 1.8824, lr_0 = 1.0000e-03
Validation binary_cross_entropy = 0.051824
Epoch 609
Loss = 2.5150e-02, PNorm = 163.9163, GNorm = 0.2909, lr_0 = 1.0000e-03
Loss = 4.3786e-02, PNorm = 164.1776, GNorm = 0.3022, lr_0 = 1.0000e-03
Validation binary_cross_entropy = 0.075254
Epoch 610
Loss = 4.1785e-02, PNorm = 164.3953, GNorm = 2.8293, lr_0 = 1.0000e-03
Validation binary_cross_entropy = 0.161873
Epoch 611
Loss = 9.6908e-02, PNorm = 164.6266, GNorm = 2.1562, lr_0 = 1.0000e-03
Validation binary_cross_entropy = 0.061042
Epoch 612
Loss = 5.2999e-02, PNorm = 164.8776, GNorm = 0.9460, lr_0 = 1.0000e-03
Validation binary_cross_entropy = 0.049906
Epoch 613
Loss = 5.4205e-02, PNorm = 165.0931, GNorm = 1.6310, lr_0 = 1.0000e-03
Validation binary_cross_entropy = 0.071204
Epoch 614
Loss = 3.3044e-02, PNorm = 165.2698, GNorm = 0.1545, lr_0 = 1.0000e-03
Validation binary_cross_entropy = 0.061517
Epoch 615
Loss = 5.7296e-02, PNorm = 165.4054, GNorm = 0.2447, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.053457
Epoch 616
Loss = 3.0405e-02, PNorm = 165.5694, GNorm = 0.1491, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.089533
Epoch 617
Loss = 4.5486e-02, PNorm = 165.7293, GNorm = 0.1055, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.104053
Epoch 618
Loss = 1.2304e-02, PNorm = 165.8949, GNorm = 0.1551, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.077008
Epoch 619
Loss = 4.9944e-02, PNorm = 166.0623, GNorm = 1.2089, lr_0 = 9.9999e-04
Loss = 4.5199e-02, PNorm = 166.2890, GNorm = 0.6376, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.067317
Epoch 620
Loss = 1.2082e-02, PNorm = 166.4858, GNorm = 1.5051, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.110852
Epoch 621
Loss = 7.7120e-02, PNorm = 166.6752, GNorm = 4.3388, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.057732
Epoch 622
Loss = 8.6047e-02, PNorm = 166.8885, GNorm = 1.8456, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.101812
Epoch 623
Loss = 7.1305e-02, PNorm = 167.1389, GNorm = 2.6022, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.066682
Epoch 624
Loss = 6.0957e-02, PNorm = 167.3498, GNorm = 0.8839, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.056521
Epoch 625
Loss = 1.9807e-02, PNorm = 167.5276, GNorm = 0.3890, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.049263
Epoch 626
Loss = 7.0907e-02, PNorm = 167.6958, GNorm = 1.1085, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.101508
Epoch 627
Loss = 4.1645e-02, PNorm = 167.8858, GNorm = 0.8191, lr_0 = 9.9999e-04
Validation binary_cross_entropy = 0.069569
Epoch 628
Loss = 2.3901e-02, PNorm = 168.0798, GNorm = 0.4510, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.072715
Epoch 629
Loss = 1.8832e-02, PNorm = 168.2513, GNorm = 0.5859, lr_0 = 9.9998e-04
Loss = 2.1080e-02, PNorm = 168.3770, GNorm = 0.8706, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.060477
Epoch 630
Loss = 4.1096e-02, PNorm = 168.4851, GNorm = 1.2261, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.067996
Epoch 631
Loss = 2.2141e-02, PNorm = 168.6271, GNorm = 0.2833, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.134695
Epoch 632
Loss = 4.3059e-02, PNorm = 168.7602, GNorm = 1.3415, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.093927
Epoch 633
Loss = 3.8106e-02, PNorm = 168.8943, GNorm = 0.3088, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.125318
Epoch 634
Loss = 3.0446e-02, PNorm = 169.0856, GNorm = 1.0151, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.040462
Epoch 635
Loss = 3.9498e-02, PNorm = 169.2949, GNorm = 2.2432, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.106077
Epoch 636
Loss = 1.6008e-01, PNorm = 169.5201, GNorm = 0.1385, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.133681
Epoch 637
Loss = 1.1230e-01, PNorm = 169.7668, GNorm = 0.9861, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.077135
Epoch 638
Loss = 1.1016e-01, PNorm = 170.0893, GNorm = 2.0948, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.063852
Epoch 639
Loss = 5.1706e-03, PNorm = 170.3877, GNorm = 0.1204, lr_0 = 9.9998e-04
Loss = 2.0111e-02, PNorm = 170.6399, GNorm = 1.1643, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.122270
Epoch 640
Loss = 6.6778e-02, PNorm = 170.8493, GNorm = 1.9277, lr_0 = 9.9998e-04
Validation binary_cross_entropy = 0.140876
Epoch 641
Loss = 6.4962e-02, PNorm = 171.0521, GNorm = 0.6330, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.056517
Epoch 642
Loss = 3.2076e-02, PNorm = 171.2602, GNorm = 0.0885, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.153249
Epoch 643
Loss = 4.8533e-02, PNorm = 171.4440, GNorm = 1.4397, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.080412
Epoch 644
Loss = 3.2366e-02, PNorm = 171.6329, GNorm = 0.3819, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.054080
Epoch 645
Loss = 2.7670e-02, PNorm = 171.8099, GNorm = 0.2105, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.063931
Epoch 646
Loss = 3.7549e-02, PNorm = 171.9402, GNorm = 2.0206, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.064918
Epoch 647
Loss = 6.6126e-02, PNorm = 172.1258, GNorm = 2.9200, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.086151
Epoch 648
Loss = 1.4545e-02, PNorm = 172.3594, GNorm = 0.5176, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.132303
Epoch 649
Loss = 2.8685e-02, PNorm = 172.6077, GNorm = 0.9932, lr_0 = 9.9997e-04
Loss = 8.4254e-02, PNorm = 172.8296, GNorm = 0.2353, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.041621
Epoch 650
Loss = 4.1201e-02, PNorm = 173.0421, GNorm = 0.6287, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.071439
Epoch 651
Loss = 4.6745e-02, PNorm = 173.2444, GNorm = 2.3433, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.100675
Epoch 652
Loss = 4.1768e-02, PNorm = 173.4563, GNorm = 3.5265, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.106065
Epoch 653
Loss = 5.9454e-02, PNorm = 173.6861, GNorm = 1.0640, lr_0 = 9.9997e-04
Validation binary_cross_entropy = 0.042138
Epoch 654
Loss = 9.0777e-02, PNorm = 173.9203, GNorm = 1.3400, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.073370
Epoch 655
Loss = 3.4649e-02, PNorm = 174.1652, GNorm = 0.3760, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.101437
Epoch 656
Loss = 1.5214e-02, PNorm = 174.3552, GNorm = 3.2859, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.310189
Epoch 657
Loss = 1.2398e-01, PNorm = 174.5240, GNorm = 1.0922, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.059455
Epoch 658
Loss = 4.5473e-02, PNorm = 174.8004, GNorm = 0.8773, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.065962
Epoch 659
Loss = 3.1729e-02, PNorm = 175.0817, GNorm = 0.3912, lr_0 = 9.9996e-04
Loss = 2.3269e-02, PNorm = 175.2961, GNorm = 0.8438, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.108378
Epoch 660
Loss = 4.5556e-02, PNorm = 175.4592, GNorm = 3.2369, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.122848
Epoch 661
Loss = 4.0048e-02, PNorm = 175.6428, GNorm = 2.0851, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.081390
Epoch 662
Loss = 5.9586e-02, PNorm = 175.8244, GNorm = 1.4665, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.058366
Epoch 663
Loss = 4.9834e-02, PNorm = 175.9935, GNorm = 1.1366, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.053213
Epoch 664
Loss = 2.6387e-02, PNorm = 176.1465, GNorm = 0.3021, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.099413
Epoch 665
Loss = 2.7953e-02, PNorm = 176.2971, GNorm = 0.2327, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.040518
Epoch 666
Loss = 4.8294e-02, PNorm = 176.4774, GNorm = 1.6404, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.064890
Epoch 667
Loss = 4.6475e-02, PNorm = 176.6406, GNorm = 0.7635, lr_0 = 9.9996e-04
Validation binary_cross_entropy = 0.086121
Epoch 668
Loss = 6.0674e-03, PNorm = 176.7929, GNorm = 0.6978, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.059742
Epoch 669
Loss = 4.2939e-03, PNorm = 176.9356, GNorm = 0.1068, lr_0 = 9.9995e-04
Loss = 4.4929e-02, PNorm = 177.0811, GNorm = 1.2321, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.063489
Epoch 670
Loss = 7.9073e-02, PNorm = 177.2265, GNorm = 0.5903, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.054322
Epoch 671
Loss = 3.8318e-02, PNorm = 177.3825, GNorm = 0.6970, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.066111
Epoch 672
Loss = 3.2665e-02, PNorm = 177.5292, GNorm = 2.2111, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.085474
Epoch 673
Loss = 6.9899e-02, PNorm = 177.6635, GNorm = 0.2225, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.095802
Epoch 674
Loss = 4.2787e-02, PNorm = 177.8526, GNorm = 0.7860, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.082602
Epoch 675
Loss = 4.9918e-02, PNorm = 178.0651, GNorm = 0.6218, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.057770
Epoch 676
Loss = 1.8634e-02, PNorm = 178.2551, GNorm = 1.9495, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.084243
Epoch 677
Loss = 7.6012e-02, PNorm = 178.4155, GNorm = 0.9567, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.045449
Epoch 678
Loss = 2.3215e-02, PNorm = 178.5602, GNorm = 1.3267, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.081751
Epoch 679
Loss = 3.6065e-02, PNorm = 178.6913, GNorm = 1.2271, lr_0 = 9.9995e-04
Loss = 1.6497e-02, PNorm = 178.8187, GNorm = 0.0484, lr_0 = 9.9995e-04
Validation binary_cross_entropy = 0.082999
Epoch 680
Loss = 4.9368e-02, PNorm = 178.9354, GNorm = 0.1532, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.076146
Epoch 681
Loss = 5.2850e-02, PNorm = 179.0944, GNorm = 1.8600, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.075307
Epoch 682
Loss = 8.2847e-02, PNorm = 179.3025, GNorm = 1.1968, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.078393
Epoch 683
Loss = 3.8267e-02, PNorm = 179.5016, GNorm = 0.1998, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.060369
Epoch 684
Loss = 7.2093e-03, PNorm = 179.7028, GNorm = 0.2298, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.087792
Epoch 685
Loss = 5.7270e-02, PNorm = 179.8375, GNorm = 1.6866, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.066811
Epoch 686
Loss = 8.2270e-03, PNorm = 180.0008, GNorm = 0.3461, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.078306
Epoch 687
Loss = 3.0659e-02, PNorm = 180.1487, GNorm = 1.5060, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.053420
Epoch 688
Loss = 1.6862e-02, PNorm = 180.3195, GNorm = 0.9268, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.043024
Epoch 689
Loss = 2.7624e-02, PNorm = 180.4784, GNorm = 0.8287, lr_0 = 9.9994e-04
Loss = 2.1828e-02, PNorm = 180.6204, GNorm = 0.0422, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.147552
Epoch 690
Loss = 4.3195e-02, PNorm = 180.7579, GNorm = 0.3835, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.048807
Epoch 691
Loss = 1.2194e-02, PNorm = 180.9054, GNorm = 0.0332, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.110759
Epoch 692
Loss = 5.0269e-02, PNorm = 181.0175, GNorm = 1.2992, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.053072
Epoch 693
Loss = 4.9371e-02, PNorm = 181.1495, GNorm = 2.4247, lr_0 = 9.9994e-04
Validation binary_cross_entropy = 0.053444
Epoch 694
Loss = 6.0402e-02, PNorm = 181.3344, GNorm = 1.7518, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.079845
Epoch 695
Loss = 9.0009e-02, PNorm = 181.5058, GNorm = 1.7764, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.067350
Epoch 696
Loss = 4.3625e-02, PNorm = 181.6807, GNorm = 0.6319, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.062610
Epoch 697
Loss = 3.2331e-02, PNorm = 181.8360, GNorm = 0.9027, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.063575
Epoch 698
Loss = 5.0606e-02, PNorm = 181.9857, GNorm = 1.8747, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.073031
Epoch 699
Loss = 7.2827e-03, PNorm = 182.1382, GNorm = 0.1606, lr_0 = 9.9993e-04
Loss = 3.5541e-02, PNorm = 182.2698, GNorm = 1.8269, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.052180
Epoch 700
Loss = 2.2132e-02, PNorm = 182.4188, GNorm = 0.0957, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.126880
Epoch 701
Loss = 4.1638e-02, PNorm = 182.6339, GNorm = 0.1168, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.090192
Epoch 702
Loss = 2.4020e-02, PNorm = 182.9609, GNorm = 2.0274, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.103074
Epoch 703
Loss = 4.4241e-02, PNorm = 183.2790, GNorm = 0.8522, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.069138
Epoch 704
Loss = 5.2875e-02, PNorm = 183.5790, GNorm = 1.3158, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.080461
Epoch 705
Loss = 3.1757e-02, PNorm = 183.8663, GNorm = 0.8975, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.145970
Epoch 706
Loss = 4.8217e-02, PNorm = 184.0883, GNorm = 0.0566, lr_0 = 9.9993e-04
Validation binary_cross_entropy = 0.058618
Epoch 707
Loss = 3.1802e-02, PNorm = 184.3084, GNorm = 0.1246, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.083460
Epoch 708
Loss = 4.6786e-02, PNorm = 184.5357, GNorm = 2.4423, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.130980
Epoch 709
Loss = 6.6398e-03, PNorm = 184.7133, GNorm = 0.2141, lr_0 = 9.9992e-04
Loss = 6.0780e-02, PNorm = 184.8586, GNorm = 2.8729, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.169871
Epoch 710
Loss = 4.8065e-02, PNorm = 185.0356, GNorm = 1.1155, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.067152
Epoch 711
Loss = 5.7725e-02, PNorm = 185.2036, GNorm = 1.0549, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.109261
Epoch 712
Loss = 6.9473e-02, PNorm = 185.3652, GNorm = 2.7197, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.062027
Epoch 713
Loss = 4.1687e-02, PNorm = 185.5564, GNorm = 0.8744, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.054686
Epoch 714
Loss = 3.9407e-02, PNorm = 185.7220, GNorm = 0.1197, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.043008
Epoch 715
Loss = 5.3431e-02, PNorm = 185.8688, GNorm = 1.8609, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.181703
Epoch 716
Loss = 8.3196e-02, PNorm = 186.0520, GNorm = 1.0044, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.052584
Epoch 717
Loss = 2.3197e-02, PNorm = 186.3037, GNorm = 0.8540, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.055412
Epoch 718
Loss = 1.9138e-02, PNorm = 186.5223, GNorm = 0.6515, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.062429
Epoch 719
Loss = 2.0179e-02, PNorm = 186.6907, GNorm = 0.8276, lr_0 = 9.9992e-04
Loss = 6.7897e-02, PNorm = 186.8341, GNorm = 0.4544, lr_0 = 9.9992e-04
Validation binary_cross_entropy = 0.051204
Epoch 720
Loss = 3.9941e-02, PNorm = 186.9941, GNorm = 0.3063, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.074069
Epoch 721
Loss = 4.8287e-02, PNorm = 187.1288, GNorm = 0.9059, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.047963
Epoch 722
Loss = 2.0357e-02, PNorm = 187.2923, GNorm = 0.1404, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.055888
Epoch 723
Loss = 1.1543e-02, PNorm = 187.4396, GNorm = 0.2334, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.074125
Epoch 724
Loss = 3.1295e-02, PNorm = 187.5499, GNorm = 0.6233, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.114808
Epoch 725
Loss = 3.7211e-02, PNorm = 187.6844, GNorm = 0.1445, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.045540
Epoch 726
Loss = 3.2692e-02, PNorm = 187.8481, GNorm = 0.1319, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.072509
Epoch 727
Loss = 2.9574e-02, PNorm = 188.0541, GNorm = 1.3307, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.073778
Epoch 728
Loss = 1.0798e-01, PNorm = 188.2981, GNorm = 0.8255, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.040751
Epoch 729
Loss = 2.5043e-01, PNorm = 188.5076, GNorm = 2.8210, lr_0 = 9.9991e-04
Loss = 4.2657e-02, PNorm = 188.7115, GNorm = 1.0849, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.048766
Epoch 730
Loss = 3.5527e-02, PNorm = 188.9076, GNorm = 0.1737, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.069043
Epoch 731
Loss = 4.4522e-02, PNorm = 189.0851, GNorm = 1.1060, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.057209
Epoch 732
Loss = 6.0731e-02, PNorm = 189.2452, GNorm = 0.4284, lr_0 = 9.9991e-04
Validation binary_cross_entropy = 0.043859
Epoch 733
Loss = 1.9848e-02, PNorm = 189.4311, GNorm = 1.3928, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.043192
Epoch 734
Loss = 3.1779e-02, PNorm = 189.5878, GNorm = 1.5938, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.060870
Epoch 735
Loss = 3.4040e-02, PNorm = 189.7215, GNorm = 1.3321, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.063216
Epoch 736
Loss = 2.0103e-03, PNorm = 189.8356, GNorm = 0.0363, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.085987
Epoch 737
Loss = 1.1364e-02, PNorm = 189.9359, GNorm = 0.0175, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.159720
Epoch 738
Loss = 2.5314e-02, PNorm = 190.0345, GNorm = 0.2181, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.052679
Epoch 739
Loss = 1.9453e-02, PNorm = 190.1546, GNorm = 0.5856, lr_0 = 9.9990e-04
Loss = 5.1208e-02, PNorm = 190.2898, GNorm = 1.3011, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.072916
Epoch 740
Loss = 2.2923e-02, PNorm = 190.4356, GNorm = 0.2094, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.084240
Epoch 741
Loss = 1.9487e-02, PNorm = 190.5665, GNorm = 0.0569, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.081342
Epoch 742
Loss = 4.1943e-02, PNorm = 190.6756, GNorm = 0.0462, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.150771
Epoch 743
Loss = 7.6332e-02, PNorm = 190.8136, GNorm = 0.2437, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.054663
Epoch 744
Loss = 3.4990e-02, PNorm = 190.9945, GNorm = 0.7009, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.070308
Epoch 745
Loss = 1.8948e-02, PNorm = 191.1628, GNorm = 2.0977, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.078099
Epoch 746
Loss = 5.8512e-02, PNorm = 191.3068, GNorm = 0.8599, lr_0 = 9.9990e-04
Validation binary_cross_entropy = 0.052830
Epoch 747
Loss = 2.6302e-02, PNorm = 191.5104, GNorm = 0.3032, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.075975
Epoch 748
Loss = 5.3916e-02, PNorm = 191.6923, GNorm = 0.7222, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.058406
Epoch 749
Loss = 2.2006e-02, PNorm = 191.8259, GNorm = 0.9521, lr_0 = 9.9989e-04
Loss = 5.0969e-02, PNorm = 191.9530, GNorm = 0.8988, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.044703
Epoch 750
Loss = 5.7754e-02, PNorm = 192.0796, GNorm = 0.6155, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.044587
Epoch 751
Loss = 2.9683e-02, PNorm = 192.2287, GNorm = 0.0454, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.046271
Epoch 752
Loss = 2.4081e-02, PNorm = 192.3720, GNorm = 0.0799, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.084539
Epoch 753
Loss = 3.3153e-02, PNorm = 192.5204, GNorm = 0.1984, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.048108
Epoch 754
Loss = 6.1601e-02, PNorm = 192.6832, GNorm = 1.5242, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.073592
Epoch 755
Loss = 3.2132e-02, PNorm = 192.8589, GNorm = 1.6661, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.142683
Epoch 756
Loss = 7.9399e-02, PNorm = 193.0450, GNorm = 0.2739, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.040162
Epoch 757
Loss = 2.4753e-02, PNorm = 193.2976, GNorm = 0.2043, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.079880
Epoch 758
Loss = 3.1349e-03, PNorm = 193.5007, GNorm = 0.0233, lr_0 = 9.9989e-04
Validation binary_cross_entropy = 0.075016
Epoch 759
Loss = 1.0151e-03, PNorm = 193.6480, GNorm = 0.0409, lr_0 = 9.9989e-04
Loss = 6.3685e-02, PNorm = 193.7902, GNorm = 1.7066, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.149728
Epoch 760
Loss = 6.2171e-02, PNorm = 193.9661, GNorm = 0.7246, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.046802
Epoch 761
Loss = 5.0586e-02, PNorm = 194.1664, GNorm = 0.3231, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.069692
Epoch 762
Loss = 2.8899e-02, PNorm = 194.3213, GNorm = 1.0432, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.043783
Epoch 763
Loss = 5.9715e-02, PNorm = 194.4522, GNorm = 0.7253, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.054933
Epoch 764
Loss = 2.5572e-02, PNorm = 194.6522, GNorm = 0.3550, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.058481
Epoch 765
Loss = 3.9632e-02, PNorm = 194.8495, GNorm = 0.7503, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.073978
Epoch 766
Loss = 4.1901e-02, PNorm = 195.0564, GNorm = 0.2457, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.051338
Epoch 767
Loss = 3.7252e-02, PNorm = 195.2491, GNorm = 0.5599, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.051957
Epoch 768
Loss = 5.5909e-03, PNorm = 195.4297, GNorm = 0.1362, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.100003
Epoch 769
Loss = 6.1732e-02, PNorm = 195.5874, GNorm = 0.7758, lr_0 = 9.9988e-04
Loss = 2.2119e-02, PNorm = 195.7654, GNorm = 0.3769, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.050972
Epoch 770
Loss = 3.3443e-02, PNorm = 195.9262, GNorm = 0.0961, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.063715
Epoch 771
Loss = 8.9305e-02, PNorm = 196.0757, GNorm = 1.8146, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.063741
Epoch 772
Loss = 6.0944e-02, PNorm = 196.2699, GNorm = 0.8724, lr_0 = 9.9988e-04
Validation binary_cross_entropy = 0.042739
Epoch 773
Loss = 2.4224e-02, PNorm = 196.4481, GNorm = 0.1788, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.058128
Epoch 774
Loss = 2.0435e-02, PNorm = 196.5812, GNorm = 1.0948, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.055651
Epoch 775
Loss = 1.2963e-02, PNorm = 196.7080, GNorm = 0.1200, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.065223
Epoch 776
Loss = 4.7097e-03, PNorm = 196.8533, GNorm = 0.0290, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.061067
Epoch 777
Loss = 4.7313e-02, PNorm = 196.9574, GNorm = 0.1000, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.061256
Epoch 778
Loss = 2.9565e-02, PNorm = 197.0496, GNorm = 0.1722, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.080371
Epoch 779
Loss = 1.9950e-03, PNorm = 197.1478, GNorm = 0.1046, lr_0 = 9.9987e-04
Loss = 1.8787e-02, PNorm = 197.2110, GNorm = 0.1051, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.054329
Epoch 780
Loss = 4.7976e-02, PNorm = 197.2875, GNorm = 0.1220, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.052313
Epoch 781
Loss = 1.3113e-02, PNorm = 197.3920, GNorm = 0.4122, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.058081
Epoch 782
Loss = 2.4407e-02, PNorm = 197.4918, GNorm = 0.8791, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.083532
Epoch 783
Loss = 6.4550e-02, PNorm = 197.6027, GNorm = 1.0795, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.042984
Epoch 784
Loss = 2.6729e-02, PNorm = 197.7217, GNorm = 0.2352, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.057240
Epoch 785
Loss = 4.3973e-02, PNorm = 197.8501, GNorm = 0.3038, lr_0 = 9.9987e-04
Validation binary_cross_entropy = 0.055339
Epoch 786
Loss = 1.6218e-02, PNorm = 197.9699, GNorm = 0.6472, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.048899
Epoch 787
Loss = 3.7125e-02, PNorm = 198.0628, GNorm = 0.6979, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.059508
Epoch 788
Loss = 1.2269e-02, PNorm = 198.1902, GNorm = 0.3483, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.048795
Epoch 789
Loss = 7.9486e-02, PNorm = 198.3132, GNorm = 0.8349, lr_0 = 9.9986e-04
Loss = 2.0070e-02, PNorm = 198.4088, GNorm = 0.0971, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.056267
Epoch 790
Loss = 1.9895e-02, PNorm = 198.4947, GNorm = 1.0173, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.070092
Epoch 791
Loss = 6.4449e-02, PNorm = 198.5994, GNorm = 5.9949, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.054685
Epoch 792
Loss = 2.0415e-02, PNorm = 198.7628, GNorm = 0.1076, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.089364
Epoch 793
Loss = 6.3032e-02, PNorm = 198.9382, GNorm = 0.8804, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.060292
Epoch 794
Loss = 5.0831e-02, PNorm = 199.1191, GNorm = 0.4249, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.070802
Epoch 795
Loss = 4.5465e-02, PNorm = 199.2838, GNorm = 0.3199, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.056973
Epoch 796
Loss = 6.1898e-02, PNorm = 199.4238, GNorm = 0.1715, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.041155
Epoch 797
Loss = 3.7969e-02, PNorm = 199.5961, GNorm = 0.1240, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.057794
Epoch 798
Loss = 9.2105e-03, PNorm = 199.7569, GNorm = 0.6321, lr_0 = 9.9986e-04
Validation binary_cross_entropy = 0.039926
Epoch 799
Loss = 4.0920e-02, PNorm = 199.9900, GNorm = 0.7282, lr_0 = 9.9986e-04
Loss = 3.3267e-02, PNorm = 200.2421, GNorm = 0.1122, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.077569
Epoch 800
Loss = 4.6245e-02, PNorm = 200.4441, GNorm = 1.2136, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.050454
Epoch 801
Loss = 3.8268e-02, PNorm = 200.6319, GNorm = 1.4600, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.046463
Epoch 802
Loss = 3.8395e-02, PNorm = 200.8030, GNorm = 0.4249, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.088021
Epoch 803
Loss = 5.1443e-02, PNorm = 200.9705, GNorm = 0.0779, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.048978
Epoch 804
Loss = 2.6868e-02, PNorm = 201.1462, GNorm = 0.4663, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.089513
Epoch 805
Loss = 2.5596e-02, PNorm = 201.2679, GNorm = 0.5235, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.035218
Epoch 806
Loss = 7.3901e-02, PNorm = 201.3894, GNorm = 0.1849, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.070033
Epoch 807
Loss = 2.9023e-02, PNorm = 201.5847, GNorm = 1.2907, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.101663
Epoch 808
Loss = 8.3144e-03, PNorm = 201.7643, GNorm = 0.3044, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.034883
Epoch 809
Loss = 1.5982e-02, PNorm = 201.9774, GNorm = 0.6086, lr_0 = 9.9985e-04
Loss = 5.9600e-02, PNorm = 202.1868, GNorm = 2.5195, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.100987
Epoch 810
Loss = 4.2223e-02, PNorm = 202.3787, GNorm = 0.2112, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.072397
Epoch 811
Loss = 3.2697e-02, PNorm = 202.5771, GNorm = 1.0578, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.082502
Epoch 812
Loss = 4.7555e-02, PNorm = 202.7348, GNorm = 1.4577, lr_0 = 9.9985e-04
Validation binary_cross_entropy = 0.039777
Epoch 813
Loss = 4.2872e-02, PNorm = 202.8915, GNorm = 1.1236, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.079598
Epoch 814
Loss = 7.4027e-02, PNorm = 203.0627, GNorm = 1.2940, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.032656
Epoch 815
Loss = 2.3766e-02, PNorm = 203.2426, GNorm = 0.1040, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.096570
Epoch 816
Loss = 1.5909e-02, PNorm = 203.4042, GNorm = 0.3645, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.045154
Epoch 817
Loss = 1.2593e-01, PNorm = 203.5259, GNorm = 0.7731, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.061107
Epoch 818
Loss = 1.1985e-02, PNorm = 203.6562, GNorm = 0.3504, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.035571
Epoch 819
Loss = 7.7615e-03, PNorm = 203.7892, GNorm = 0.1426, lr_0 = 9.9984e-04
Loss = 4.6522e-02, PNorm = 203.9393, GNorm = 1.2158, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.048522
Epoch 820
Loss = 1.8465e-02, PNorm = 204.1066, GNorm = 0.5000, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.047781
Epoch 821
Loss = 3.6494e-02, PNorm = 204.2839, GNorm = 2.9198, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.052223
Epoch 822
Loss = 7.1883e-02, PNorm = 204.4833, GNorm = 0.9653, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.075316
Epoch 823
Loss = 6.0455e-02, PNorm = 204.7201, GNorm = 2.1720, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.057316
Epoch 824
Loss = 3.0775e-02, PNorm = 204.9173, GNorm = 0.6582, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.066350
Epoch 825
Loss = 4.7365e-02, PNorm = 205.1000, GNorm = 0.2637, lr_0 = 9.9984e-04
Validation binary_cross_entropy = 0.050075
Epoch 826
Loss = 7.2659e-02, PNorm = 205.2755, GNorm = 0.1811, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.032882
Epoch 827
Loss = 1.3678e-01, PNorm = 205.5175, GNorm = 2.3058, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.088604
Epoch 828
Loss = 7.4762e-02, PNorm = 205.8161, GNorm = 0.8440, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.061029
Epoch 829
Loss = 5.7196e-02, PNorm = 206.0654, GNorm = 1.7914, lr_0 = 9.9983e-04
Loss = 1.3756e-02, PNorm = 206.2617, GNorm = 2.4965, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.057591
Epoch 830
Loss = 4.5543e-02, PNorm = 206.4253, GNorm = 0.8890, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.050401
Epoch 831
Loss = 5.2757e-02, PNorm = 206.6699, GNorm = 0.9653, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.035917
Epoch 832
Loss = 5.4592e-02, PNorm = 206.9289, GNorm = 0.6758, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.050920
Epoch 833
Loss = 9.8537e-02, PNorm = 207.1608, GNorm = 0.5896, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.086663
Epoch 834
Loss = 5.3170e-02, PNorm = 207.3985, GNorm = 1.3556, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.047393
Epoch 835
Loss = 2.0569e-02, PNorm = 207.5961, GNorm = 1.1267, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.062050
Epoch 836
Loss = 2.3921e-02, PNorm = 207.7741, GNorm = 1.0743, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.060507
Epoch 837
Loss = 4.6805e-02, PNorm = 207.9687, GNorm = 0.1801, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.057839
Epoch 838
Loss = 1.5979e-02, PNorm = 208.1677, GNorm = 0.9117, lr_0 = 9.9983e-04
Validation binary_cross_entropy = 0.067478
Epoch 839
Loss = 9.6230e-03, PNorm = 208.3986, GNorm = 0.1670, lr_0 = 9.9983e-04
Loss = 2.5049e-02, PNorm = 208.6336, GNorm = 0.4833, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.087940
Epoch 840
Loss = 3.8268e-02, PNorm = 208.8385, GNorm = 1.8654, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.045892
Epoch 841
Loss = 2.7431e-02, PNorm = 209.0370, GNorm = 0.2488, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.054771
Epoch 842
Loss = 3.8054e-02, PNorm = 209.2128, GNorm = 1.0276, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.042706
Epoch 843
Loss = 5.5890e-02, PNorm = 209.4095, GNorm = 0.5228, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.047965
Epoch 844
Loss = 2.5570e-02, PNorm = 209.5924, GNorm = 1.2521, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.041958
Epoch 845
Loss = 1.4910e-02, PNorm = 209.7602, GNorm = 0.5405, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.078311
Epoch 846
Loss = 3.7965e-02, PNorm = 209.9099, GNorm = 1.0468, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.140289
Epoch 847
Loss = 8.4695e-02, PNorm = 210.0419, GNorm = 1.3719, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.053581
Epoch 848
Loss = 2.3454e-02, PNorm = 210.1987, GNorm = 0.7719, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.046577
Epoch 849
Loss = 7.1039e-02, PNorm = 210.3731, GNorm = 0.6283, lr_0 = 9.9982e-04
Loss = 1.7707e-02, PNorm = 210.5378, GNorm = 0.1124, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.061989
Epoch 850
Loss = 1.5678e-02, PNorm = 210.6633, GNorm = 0.0776, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.086251
Epoch 851
Loss = 3.6966e-02, PNorm = 210.7901, GNorm = 0.8821, lr_0 = 9.9982e-04
Validation binary_cross_entropy = 0.075976
Epoch 852
Loss = 2.6550e-02, PNorm = 210.9151, GNorm = 1.5847, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.067437
Epoch 853
Loss = 6.5022e-02, PNorm = 211.0637, GNorm = 1.1370, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.050173
Epoch 854
Loss = 1.5800e-02, PNorm = 211.2383, GNorm = 0.1159, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.090318
Epoch 855
Loss = 1.0611e-02, PNorm = 211.3776, GNorm = 0.2315, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.060556
Epoch 856
Loss = 3.1375e-02, PNorm = 211.4971, GNorm = 1.6523, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.060904
Epoch 857
Loss = 8.7769e-03, PNorm = 211.6306, GNorm = 0.3712, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.088240
Epoch 858
Loss = 7.5813e-02, PNorm = 211.7446, GNorm = 0.9672, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.052822
Epoch 859
Loss = 8.2249e-03, PNorm = 211.8898, GNorm = 0.3084, lr_0 = 9.9981e-04
Loss = 5.2039e-02, PNorm = 212.0102, GNorm = 0.9421, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.066860
Epoch 860
Loss = 4.6161e-02, PNorm = 212.1521, GNorm = 1.2642, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.070558
Epoch 861
Loss = 5.2853e-02, PNorm = 212.3048, GNorm = 2.4755, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.047402
Epoch 862
Loss = 2.0203e-02, PNorm = 212.4649, GNorm = 0.1269, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.062653
Epoch 863
Loss = 3.1189e-02, PNorm = 212.5909, GNorm = 0.1887, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.052220
Epoch 864
Loss = 2.6824e-02, PNorm = 212.7100, GNorm = 0.0533, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.064256
Epoch 865
Loss = 1.2268e-02, PNorm = 212.8219, GNorm = 1.3060, lr_0 = 9.9981e-04
Validation binary_cross_entropy = 0.073416
Epoch 866
Loss = 2.8451e-02, PNorm = 212.9338, GNorm = 0.1100, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.066438
Epoch 867
Loss = 1.4284e-02, PNorm = 213.0825, GNorm = 0.7647, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.071837
Epoch 868
Loss = 3.4438e-03, PNorm = 213.2330, GNorm = 0.0600, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.052587
Epoch 869
Loss = 8.8839e-03, PNorm = 213.3441, GNorm = 0.4022, lr_0 = 9.9980e-04
Loss = 3.3851e-02, PNorm = 213.4671, GNorm = 0.7413, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.041195
Epoch 870
Loss = 3.2046e-02, PNorm = 213.6233, GNorm = 0.9427, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.063898
Epoch 871
Loss = 2.6887e-02, PNorm = 213.7428, GNorm = 0.1356, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.065553
Epoch 872
Loss = 2.3800e-02, PNorm = 213.8350, GNorm = 0.0465, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.054163
Epoch 873
Loss = 4.3459e-02, PNorm = 213.9176, GNorm = 1.5777, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.046154
Epoch 874
Loss = 4.9102e-02, PNorm = 214.0287, GNorm = 0.2611, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.051604
Epoch 875
Loss = 1.7983e-02, PNorm = 214.1727, GNorm = 0.0861, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.110097
Epoch 876
Loss = 4.2114e-02, PNorm = 214.3086, GNorm = 1.2918, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.081353
Epoch 877
Loss = 5.4680e-02, PNorm = 214.4265, GNorm = 0.4560, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.122080
Epoch 878
Loss = 3.2950e-03, PNorm = 214.5889, GNorm = 0.4062, lr_0 = 9.9980e-04
Validation binary_cross_entropy = 0.099435
Epoch 879
Loss = 2.1419e-02, PNorm = 214.7684, GNorm = 0.8732, lr_0 = 9.9979e-04
Loss = 7.8388e-02, PNorm = 214.9684, GNorm = 1.2103, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.059816
Epoch 880
Loss = 6.8726e-02, PNorm = 215.2098, GNorm = 0.9437, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.063923
Epoch 881
Loss = 4.2253e-02, PNorm = 215.4452, GNorm = 0.5381, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.038122
Epoch 882
Loss = 9.4898e-03, PNorm = 215.6437, GNorm = 0.0563, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.100444
Epoch 883
Loss = 5.5644e-02, PNorm = 215.7916, GNorm = 0.0214, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.043684
Epoch 884
Loss = 4.5979e-02, PNorm = 215.9815, GNorm = 1.0155, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.056490
Epoch 885
Loss = 3.8434e-02, PNorm = 216.1941, GNorm = 0.3715, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.133445
Epoch 886
Loss = 5.0048e-02, PNorm = 216.3979, GNorm = 0.2170, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.032234
Epoch 887
Loss = 6.6370e-02, PNorm = 216.6537, GNorm = 1.1483, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.096393
Epoch 888
Loss = 1.7461e-02, PNorm = 216.8539, GNorm = 0.0465, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.045466
Epoch 889
Loss = 7.9102e-03, PNorm = 217.0413, GNorm = 0.1649, lr_0 = 9.9979e-04
Loss = 4.7616e-02, PNorm = 217.2314, GNorm = 1.4050, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.064860
Epoch 890
Loss = 1.4113e-02, PNorm = 217.3923, GNorm = 0.6860, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.045334
Epoch 891
Loss = 4.7921e-02, PNorm = 217.5343, GNorm = 1.2572, lr_0 = 9.9979e-04
Validation binary_cross_entropy = 0.083094
Epoch 892
Loss = 4.2721e-02, PNorm = 217.7137, GNorm = 2.1328, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.053098
Epoch 893
Loss = 3.5333e-02, PNorm = 217.8970, GNorm = 0.7100, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.076487
Epoch 894
Loss = 9.6213e-02, PNorm = 218.0732, GNorm = 2.0889, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.056151
Epoch 895
Loss = 1.6612e-02, PNorm = 218.2493, GNorm = 0.1903, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.058132
Epoch 896
Loss = 2.2292e-02, PNorm = 218.4185, GNorm = 0.1083, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.048398
Epoch 897
Loss = 4.3557e-03, PNorm = 218.5620, GNorm = 0.1552, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.112565
Epoch 898
Loss = 2.0223e-03, PNorm = 218.6968, GNorm = 0.0236, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.055706
Epoch 899
Loss = 2.9994e-03, PNorm = 218.8423, GNorm = 0.2374, lr_0 = 9.9978e-04
Loss = 3.5635e-02, PNorm = 218.9913, GNorm = 0.2524, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.060400
Epoch 900
Loss = 1.9850e-02, PNorm = 219.1312, GNorm = 2.4832, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.068899
Epoch 901
Loss = 4.4116e-02, PNorm = 219.2726, GNorm = 0.0828, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.104907
Epoch 902
Loss = 6.2597e-02, PNorm = 219.4641, GNorm = 1.1646, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.039619
Epoch 903
Loss = 4.8710e-02, PNorm = 219.6717, GNorm = 0.5114, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.064158
Epoch 904
Loss = 3.0940e-02, PNorm = 219.8880, GNorm = 0.1830, lr_0 = 9.9978e-04
Validation binary_cross_entropy = 0.062032
Epoch 905
Loss = 3.5278e-02, PNorm = 220.0581, GNorm = 1.1059, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.077705
Epoch 906
Loss = 1.2307e-01, PNorm = 220.1539, GNorm = 0.3072, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.032853
Epoch 907
Loss = 5.3941e-02, PNorm = 220.2582, GNorm = 0.9025, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.053196
Epoch 908
Loss = 5.4645e-02, PNorm = 220.3954, GNorm = 1.0323, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.050150
Epoch 909
Loss = 1.5893e-01, PNorm = 220.5443, GNorm = 1.8789, lr_0 = 9.9977e-04
Loss = 2.6340e-02, PNorm = 220.6930, GNorm = 0.5686, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.042117
Epoch 910
Loss = 3.7334e-02, PNorm = 220.8561, GNorm = 0.8167, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.053990
Epoch 911
Loss = 2.8424e-02, PNorm = 221.0209, GNorm = 0.6297, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.043581
Epoch 912
Loss = 2.5238e-02, PNorm = 221.1906, GNorm = 0.0084, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.095670
Epoch 913
Loss = 3.2098e-02, PNorm = 221.3075, GNorm = 0.3790, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.041835
Epoch 914
Loss = 1.8859e-02, PNorm = 221.4374, GNorm = 0.3147, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.051532
Epoch 915
Loss = 4.8765e-02, PNorm = 221.5683, GNorm = 1.7284, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.039636
Epoch 916
Loss = 6.7230e-02, PNorm = 221.6945, GNorm = 0.2522, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.047058
Epoch 917
Loss = 1.9276e-02, PNorm = 221.8326, GNorm = 0.4719, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.054732
Epoch 918
Loss = 2.8968e-02, PNorm = 221.9706, GNorm = 0.1140, lr_0 = 9.9977e-04
Validation binary_cross_entropy = 0.038932
Epoch 919
Loss = 3.6686e-02, PNorm = 222.0966, GNorm = 0.9609, lr_0 = 9.9976e-04
Loss = 3.7825e-02, PNorm = 222.1976, GNorm = 0.8169, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.061668
Epoch 920
Loss = 7.2686e-02, PNorm = 222.3147, GNorm = 2.3844, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.045669
Epoch 921
Loss = 4.1688e-02, PNorm = 222.4704, GNorm = 0.8075, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.076892
Epoch 922
Loss = 4.7092e-02, PNorm = 222.6756, GNorm = 1.3927, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.065999
Epoch 923
Loss = 1.7547e-02, PNorm = 222.8105, GNorm = 0.0828, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.063019
Epoch 924
Loss = 2.6409e-02, PNorm = 222.9129, GNorm = 0.2583, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.076281
Epoch 925
Loss = 2.0876e-02, PNorm = 223.0669, GNorm = 0.6522, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.071042
Epoch 926
Loss = 5.6426e-02, PNorm = 223.2151, GNorm = 1.0669, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.079738
Epoch 927
Loss = 3.2821e-02, PNorm = 223.4211, GNorm = 1.4271, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.073453
Epoch 928
Loss = 3.2493e-02, PNorm = 223.6679, GNorm = 0.1496, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.058554
Epoch 929
Loss = 1.4496e-01, PNorm = 223.9062, GNorm = 2.3169, lr_0 = 9.9976e-04
Loss = 3.6035e-02, PNorm = 224.1495, GNorm = 1.8503, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.070836
Epoch 930
Loss = 9.9685e-03, PNorm = 224.3762, GNorm = 0.0056, lr_0 = 9.9976e-04
Validation binary_cross_entropy = 0.114293
Epoch 931
Loss = 7.7017e-02, PNorm = 224.5379, GNorm = 0.3096, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.072709
Epoch 932
Loss = 2.2389e-02, PNorm = 224.7225, GNorm = 1.2879, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.062608
Epoch 933
Loss = 2.1386e-02, PNorm = 224.9289, GNorm = 0.6786, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.107486
Epoch 934
Loss = 2.9115e-02, PNorm = 225.1023, GNorm = 0.7977, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.067926
Epoch 935
Loss = 2.5743e-02, PNorm = 225.2827, GNorm = 0.8335, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.057695
Epoch 936
Loss = 2.4146e-02, PNorm = 225.4827, GNorm = 0.7950, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.056339
Epoch 937
Loss = 3.6338e-02, PNorm = 225.7143, GNorm = 0.7573, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.075618
Epoch 938
Loss = 1.7611e-02, PNorm = 225.9433, GNorm = 0.5800, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.073870
Epoch 939
Loss = 1.3427e-02, PNorm = 226.1885, GNorm = 0.3809, lr_0 = 9.9975e-04
Loss = 2.4484e-02, PNorm = 226.4161, GNorm = 0.4930, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.060794
Epoch 940
Loss = 2.0462e-02, PNorm = 226.5900, GNorm = 1.0371, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.067653
Epoch 941
Loss = 5.0756e-02, PNorm = 226.9068, GNorm = 3.1293, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.106568
Epoch 942
Loss = 3.3115e-02, PNorm = 227.2657, GNorm = 0.4665, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.071597
Epoch 943
Loss = 4.5318e-02, PNorm = 227.5876, GNorm = 1.7880, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.171758
Epoch 944
Loss = 8.1300e-02, PNorm = 227.8789, GNorm = 1.0739, lr_0 = 9.9975e-04
Validation binary_cross_entropy = 0.047409
Epoch 945
Loss = 3.9850e-02, PNorm = 228.1557, GNorm = 0.2064, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.077419
Epoch 946
Loss = 1.7636e-02, PNorm = 228.3571, GNorm = 1.2226, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.075242
Epoch 947
Loss = 2.5041e-02, PNorm = 228.5232, GNorm = 0.7540, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.065491
Epoch 948
Loss = 3.5964e-03, PNorm = 228.6581, GNorm = 0.0518, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.044425
Epoch 949
Loss = 1.3688e-01, PNorm = 228.7861, GNorm = 2.2348, lr_0 = 9.9974e-04
Loss = 1.4889e-02, PNorm = 228.9491, GNorm = 0.0324, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.100554
Epoch 950
Loss = 6.3248e-02, PNorm = 229.1441, GNorm = 0.0823, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.540276
Epoch 951
Loss = 1.1270e-01, PNorm = 229.5144, GNorm = 0.2067, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.073438
Epoch 952
Loss = 5.6574e-02, PNorm = 229.8533, GNorm = 1.7919, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.096788
Epoch 953
Loss = 4.5530e-02, PNorm = 230.0984, GNorm = 0.8386, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.072214
Epoch 954
Loss = 3.2809e-02, PNorm = 230.2717, GNorm = 0.9637, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.041551
Epoch 955
Loss = 1.1404e-01, PNorm = 230.4453, GNorm = 0.3895, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.114746
Epoch 956
Loss = 7.4773e-02, PNorm = 230.6263, GNorm = 1.0489, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.042420
Epoch 957
Loss = 5.0102e-02, PNorm = 230.8080, GNorm = 1.9846, lr_0 = 9.9974e-04
Validation binary_cross_entropy = 0.116590
Epoch 958
Loss = 2.5593e-02, PNorm = 231.0010, GNorm = 2.4692, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.053029
Epoch 959
Loss = 6.7797e-03, PNorm = 231.1538, GNorm = 0.1526, lr_0 = 9.9973e-04
Loss = 1.8846e-02, PNorm = 231.3253, GNorm = 2.0051, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.076598
Epoch 960
Loss = 5.6323e-02, PNorm = 231.4549, GNorm = 0.1970, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.054578
Epoch 961
Loss = 2.6274e-02, PNorm = 231.5772, GNorm = 0.3579, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.074904
Epoch 962
Loss = 1.7753e-02, PNorm = 231.7027, GNorm = 0.1163, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.040386
Epoch 963
Loss = 1.7067e-02, PNorm = 231.8588, GNorm = 0.7066, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.127348
Epoch 964
Loss = 2.6731e-02, PNorm = 231.9877, GNorm = 0.0384, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.066383
Epoch 965
Loss = 1.4354e-02, PNorm = 232.1367, GNorm = 1.4936, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.080622
Epoch 966
Loss = 1.6681e-02, PNorm = 232.2615, GNorm = 1.0716, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.084093
Epoch 967
Loss = 5.3188e-02, PNorm = 232.3476, GNorm = 0.4812, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.106950
Epoch 968
Loss = 1.4652e-02, PNorm = 232.4698, GNorm = 0.1134, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.043938
Epoch 969
Loss = 1.0873e-02, PNorm = 232.6034, GNorm = 0.2561, lr_0 = 9.9973e-04
Loss = 4.9779e-02, PNorm = 232.7560, GNorm = 1.9090, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.055082
Epoch 970
Loss = 3.0566e-02, PNorm = 232.8926, GNorm = 0.6270, lr_0 = 9.9973e-04
Validation binary_cross_entropy = 0.052502
Epoch 971
Loss = 3.9377e-02, PNorm = 233.0261, GNorm = 0.8899, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.051071
Epoch 972
Loss = 2.6190e-02, PNorm = 233.1533, GNorm = 2.1219, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.061508
Epoch 973
Loss = 6.5739e-02, PNorm = 233.3062, GNorm = 0.1820, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.085225
Epoch 974
Loss = 3.7459e-02, PNorm = 233.5045, GNorm = 0.1796, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.059630
Epoch 975
Loss = 4.9599e-02, PNorm = 233.6726, GNorm = 2.5176, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.064712
Epoch 976
Loss = 4.4098e-02, PNorm = 233.8345, GNorm = 0.7964, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.051789
Epoch 977
Loss = 3.6145e-02, PNorm = 234.0079, GNorm = 1.9060, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.076363
Epoch 978
Loss = 5.7429e-03, PNorm = 234.1662, GNorm = 0.0223, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.061803
Epoch 979
Loss = 1.1627e-02, PNorm = 234.3198, GNorm = 0.4211, lr_0 = 9.9972e-04
Loss = 3.0690e-02, PNorm = 234.5188, GNorm = 1.2244, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.045864
Epoch 980
Loss = 2.4127e-02, PNorm = 234.6734, GNorm = 0.5062, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.060727
Epoch 981
Loss = 3.7751e-02, PNorm = 234.7955, GNorm = 2.7538, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.043013
Epoch 982
Loss = 1.8091e-02, PNorm = 234.9472, GNorm = 0.0462, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.148450
Epoch 983
Loss = 5.0044e-02, PNorm = 235.0776, GNorm = 1.5455, lr_0 = 9.9972e-04
Validation binary_cross_entropy = 0.048873
Epoch 984
Loss = 3.9168e-02, PNorm = 235.2177, GNorm = 1.3827, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.073708
Epoch 985
Loss = 1.0113e-01, PNorm = 235.3425, GNorm = 0.4899, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.038396
Epoch 986
Loss = 2.3259e-02, PNorm = 235.4634, GNorm = 0.1526, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.069541
Epoch 987
Loss = 1.4515e-02, PNorm = 235.5744, GNorm = 3.7354, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.051240
Epoch 988
Loss = 1.3045e-01, PNorm = 235.6810, GNorm = 0.1259, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.076285
Epoch 989
Loss = 4.8910e-03, PNorm = 235.8134, GNorm = 0.1300, lr_0 = 9.9971e-04
Loss = 1.6118e-02, PNorm = 235.9167, GNorm = 0.0468, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.074069
Epoch 990
Loss = 4.3647e-02, PNorm = 236.0074, GNorm = 0.4903, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.041766
Epoch 991
Loss = 1.0657e-02, PNorm = 236.1595, GNorm = 0.0271, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.089444
Epoch 992
Loss = 5.5176e-02, PNorm = 236.2695, GNorm = 0.9978, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.073811
Epoch 993
Loss = 7.4368e-02, PNorm = 236.3625, GNorm = 0.7015, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.044521
Epoch 994
Loss = 2.5927e-02, PNorm = 236.4844, GNorm = 1.0475, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.065822
Epoch 995
Loss = 7.0916e-03, PNorm = 236.6059, GNorm = 0.0330, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.087527
Epoch 996
Loss = 7.7554e-04, PNorm = 236.6917, GNorm = 0.0330, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.082458
Epoch 997
Loss = 1.0716e-01, PNorm = 236.7599, GNorm = 1.7453, lr_0 = 9.9971e-04
Validation binary_cross_entropy = 0.050734
Epoch 998
Loss = 3.7303e-03, PNorm = 236.8601, GNorm = 0.0511, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.069955
Epoch 999
Loss = 5.9243e-03, PNorm = 237.0156, GNorm = 0.1832, lr_0 = 9.9970e-04
Loss = 2.9547e-02, PNorm = 237.1558, GNorm = 0.9074, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.067726
Epoch 1000
Loss = 2.1561e-02, PNorm = 237.2859, GNorm = 0.1674, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.062597
Epoch 1001
Loss = 1.5866e-02, PNorm = 237.3856, GNorm = 0.0354, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.086537
Epoch 1002
Loss = 4.1370e-02, PNorm = 237.4921, GNorm = 1.6788, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.039590
Epoch 1003
Loss = 5.8641e-02, PNorm = 237.6610, GNorm = 0.1643, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.086521
Epoch 1004
Loss = 5.4461e-02, PNorm = 237.8202, GNorm = 1.0691, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.045326
Epoch 1005
Loss = 2.8017e-02, PNorm = 237.9533, GNorm = 0.1569, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.058058
Epoch 1006
Loss = 3.4057e-03, PNorm = 238.0621, GNorm = 0.1042, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.053420
Epoch 1007
Loss = 4.4547e-03, PNorm = 238.1608, GNorm = 0.1135, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.087766
Epoch 1008
Loss = 5.7033e-03, PNorm = 238.2747, GNorm = 0.2431, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.061688
Epoch 1009
Loss = 1.6789e-01, PNorm = 238.4301, GNorm = 1.7922, lr_0 = 9.9970e-04
Loss = 2.5244e-02, PNorm = 238.5629, GNorm = 0.9093, lr_0 = 9.9970e-04
Validation binary_cross_entropy = 0.058078
Epoch 1010
Loss = 2.5968e-02, PNorm = 238.6723, GNorm = 0.3127, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.053195
Epoch 1011
Loss = 4.3487e-02, PNorm = 238.7769, GNorm = 0.1702, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.051230
Epoch 1012
Loss = 2.0709e-02, PNorm = 238.9173, GNorm = 0.0481, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.087074
Epoch 1013
Loss = 9.0947e-02, PNorm = 239.0238, GNorm = 0.0441, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.044115
Epoch 1014
Loss = 4.8340e-02, PNorm = 239.1467, GNorm = 0.4904, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.096803
Epoch 1015
Loss = 2.9380e-02, PNorm = 239.3088, GNorm = 0.3556, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.055785
Epoch 1016
Loss = 7.0266e-03, PNorm = 239.4590, GNorm = 0.1484, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.055603
Epoch 1017
Loss = 6.9433e-03, PNorm = 239.5861, GNorm = 0.8702, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.041089
Epoch 1018
Loss = 5.5310e-02, PNorm = 239.7091, GNorm = 1.7845, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.061966
Epoch 1019
Loss = 1.9834e-03, PNorm = 239.8569, GNorm = 0.0911, lr_0 = 9.9969e-04
Loss = 1.9246e-02, PNorm = 239.9700, GNorm = 2.1914, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.093446
Epoch 1020
Loss = 5.8234e-02, PNorm = 240.1237, GNorm = 0.2980, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.045548
Epoch 1021
Loss = 5.3388e-02, PNorm = 240.3134, GNorm = 1.4525, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.057065
Epoch 1022
Loss = 2.0791e-02, PNorm = 240.4765, GNorm = 0.5014, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.050376
Epoch 1023
Loss = 5.1370e-03, PNorm = 240.6266, GNorm = 0.0797, lr_0 = 9.9969e-04
Validation binary_cross_entropy = 0.074511
Epoch 1024
Loss = 1.1787e-02, PNorm = 240.7180, GNorm = 0.1172, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.051157
Epoch 1025
Loss = 4.3339e-02, PNorm = 240.8292, GNorm = 1.2529, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.062972
Epoch 1026
Loss = 5.3958e-03, PNorm = 240.9570, GNorm = 0.5780, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.181768
Epoch 1027
Loss = 2.4496e-02, PNorm = 241.0924, GNorm = 0.1327, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.057294
Epoch 1028
Loss = 2.3816e-02, PNorm = 241.2749, GNorm = 0.1274, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.075330
Epoch 1029
Loss = 6.9336e-02, PNorm = 241.4849, GNorm = 1.5811, lr_0 = 9.9968e-04
Loss = 2.0451e-02, PNorm = 241.6432, GNorm = 0.2492, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.078316
Epoch 1030
Loss = 1.4235e-02, PNorm = 241.7669, GNorm = 0.1945, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.090794
Epoch 1031
Loss = 4.7048e-02, PNorm = 241.8674, GNorm = 0.4729, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.064267
Epoch 1032
Loss = 2.5559e-02, PNorm = 241.9966, GNorm = 1.1877, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.052410
Epoch 1033
Loss = 3.3214e-02, PNorm = 242.1735, GNorm = 0.1002, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.065936
Epoch 1034
Loss = 3.0967e-02, PNorm = 242.3517, GNorm = 0.4577, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.063133
Epoch 1035
Loss = 3.8609e-02, PNorm = 242.5392, GNorm = 2.3115, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.101462
Epoch 1036
Loss = 2.4130e-02, PNorm = 242.7168, GNorm = 0.2289, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.108313
Epoch 1037
Loss = 1.1132e-02, PNorm = 242.8668, GNorm = 0.2929, lr_0 = 9.9968e-04
Validation binary_cross_entropy = 0.083497
Epoch 1038
Loss = 1.8695e-02, PNorm = 243.0282, GNorm = 0.7707, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.080502
Epoch 1039
Loss = 3.0936e-02, PNorm = 243.1832, GNorm = 1.0718, lr_0 = 9.9967e-04
Loss = 2.5435e-02, PNorm = 243.3333, GNorm = 0.1935, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.061305
Epoch 1040
Loss = 1.8164e-02, PNorm = 243.4563, GNorm = 0.4397, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.083549
Epoch 1041
Loss = 8.4477e-03, PNorm = 243.5520, GNorm = 0.0516, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.071116
Epoch 1042
Loss = 1.6120e-03, PNorm = 243.6220, GNorm = 0.0288, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.089931
Epoch 1043
Loss = 1.9416e-02, PNorm = 243.6941, GNorm = 0.8053, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.256084
Epoch 1044
Loss = 1.5800e-01, PNorm = 243.9707, GNorm = 2.7185, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.056581
Epoch 1045
Loss = 3.8854e-02, PNorm = 244.3472, GNorm = 0.7152, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.095783
Epoch 1046
Loss = 2.9329e-02, PNorm = 244.6426, GNorm = 0.1756, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.083042
Epoch 1047
Loss = 4.6632e-02, PNorm = 244.8357, GNorm = 1.2694, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.059148
Epoch 1048
Loss = 9.9226e-03, PNorm = 245.0022, GNorm = 0.5282, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.088708
Epoch 1049
Loss = 1.0992e-01, PNorm = 245.1573, GNorm = 1.3827, lr_0 = 9.9967e-04
Loss = 6.7032e-02, PNorm = 245.3057, GNorm = 0.5021, lr_0 = 9.9967e-04
Validation binary_cross_entropy = 0.056179
Epoch 1050
Loss = 5.9122e-02, PNorm = 245.4913, GNorm = 0.5696, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.061752
Epoch 1051
Loss = 2.2554e-02, PNorm = 245.6527, GNorm = 0.1272, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.059262
Epoch 1052
Loss = 5.4522e-02, PNorm = 245.7978, GNorm = 0.2713, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.055735
Epoch 1053
Loss = 3.2392e-02, PNorm = 245.9586, GNorm = 0.5656, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.054043
Epoch 1054
Loss = 9.3118e-03, PNorm = 246.1095, GNorm = 0.6012, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.099375
Epoch 1055
Loss = 5.3693e-02, PNorm = 246.2286, GNorm = 1.3403, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.045416
Epoch 1056
Loss = 2.1783e-02, PNorm = 246.3312, GNorm = 1.0952, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.070408
Epoch 1057
Loss = 5.2897e-03, PNorm = 246.4773, GNorm = 0.0304, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.055914
Epoch 1058
Loss = 3.0952e-02, PNorm = 246.5953, GNorm = 0.7013, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.078455
Epoch 1059
Loss = 8.3215e-02, PNorm = 246.6983, GNorm = 4.7110, lr_0 = 9.9966e-04
Loss = 5.6156e-02, PNorm = 246.8185, GNorm = 0.2573, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.055447
Epoch 1060
Loss = 4.0047e-02, PNorm = 246.9796, GNorm = 0.8982, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.059212
Epoch 1061
Loss = 5.2916e-02, PNorm = 247.1245, GNorm = 0.7019, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.052327
Epoch 1062
Loss = 2.0193e-02, PNorm = 247.2872, GNorm = 0.1607, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.066167
Epoch 1063
Loss = 2.3001e-02, PNorm = 247.4628, GNorm = 0.1601, lr_0 = 9.9966e-04
Validation binary_cross_entropy = 0.050636
Epoch 1064
Loss = 2.0681e-02, PNorm = 247.6418, GNorm = 0.4002, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.067154
Epoch 1065
Loss = 2.1929e-02, PNorm = 247.8144, GNorm = 0.3408, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.056751
Epoch 1066
Loss = 3.2299e-02, PNorm = 247.9753, GNorm = 2.9535, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.064746
Epoch 1067
Loss = 5.0745e-03, PNorm = 248.1276, GNorm = 0.6530, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.046527
Epoch 1068
Loss = 2.8155e-02, PNorm = 248.2562, GNorm = 0.9311, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.075932
Epoch 1069
Loss = 2.8785e-03, PNorm = 248.4061, GNorm = 0.0982, lr_0 = 9.9965e-04
Loss = 2.6541e-02, PNorm = 248.5559, GNorm = 0.4923, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.048326
Epoch 1070
Loss = 1.5196e-02, PNorm = 248.6965, GNorm = 0.1483, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.082524
Epoch 1071
Loss = 3.0234e-02, PNorm = 248.8045, GNorm = 0.8189, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.058148
Epoch 1072
Loss = 2.4763e-02, PNorm = 248.9361, GNorm = 0.9335, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.051962
Epoch 1073
Loss = 1.2451e-02, PNorm = 249.0837, GNorm = 0.0783, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.077121
Epoch 1074
Loss = 2.5154e-02, PNorm = 249.1925, GNorm = 3.4700, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.076161
Epoch 1075
Loss = 5.5211e-03, PNorm = 249.3255, GNorm = 0.4853, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.048135
Epoch 1076
Loss = 6.6796e-02, PNorm = 249.4439, GNorm = 1.2430, lr_0 = 9.9965e-04
Validation binary_cross_entropy = 0.071697
Epoch 1077
Loss = 2.4014e-03, PNorm = 249.5993, GNorm = 0.0215, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.057086
Epoch 1078
Loss = 7.1262e-02, PNorm = 249.7197, GNorm = 0.1225, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.056475
Epoch 1079
Loss = 1.2895e-02, PNorm = 249.8938, GNorm = 0.2995, lr_0 = 9.9964e-04
Loss = 4.6127e-02, PNorm = 250.0951, GNorm = 2.7582, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.064039
Epoch 1080
Loss = 2.0455e-02, PNorm = 250.2686, GNorm = 0.3907, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.049242
Epoch 1081
Loss = 3.8250e-02, PNorm = 250.3951, GNorm = 0.0371, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.065036
Epoch 1082
Loss = 1.7686e-02, PNorm = 250.5099, GNorm = 0.1741, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.043449
Epoch 1083
Loss = 1.8558e-02, PNorm = 250.6071, GNorm = 1.2059, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.056558
Epoch 1084
Loss = 3.0647e-02, PNorm = 250.7285, GNorm = 1.1016, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.085652
Epoch 1085
Loss = 5.0270e-02, PNorm = 250.8690, GNorm = 0.2696, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.045123
Epoch 1086
Loss = 2.7297e-02, PNorm = 251.0303, GNorm = 0.5130, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.049305
Epoch 1087
Loss = 1.6306e-02, PNorm = 251.1887, GNorm = 0.3983, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.070906
Epoch 1088
Loss = 7.6066e-02, PNorm = 251.3183, GNorm = 1.4907, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.049100
Epoch 1089
Loss = 2.8375e-02, PNorm = 251.4409, GNorm = 0.7628, lr_0 = 9.9964e-04
Loss = 2.5688e-02, PNorm = 251.5681, GNorm = 0.8016, lr_0 = 9.9964e-04
Validation binary_cross_entropy = 0.047038
Epoch 1090
Loss = 6.9686e-02, PNorm = 251.6821, GNorm = 2.6383, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.033749
Epoch 1091
Loss = 3.6663e-02, PNorm = 251.8173, GNorm = 2.1704, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.043803
Epoch 1092
Loss = 1.9114e-02, PNorm = 251.9737, GNorm = 0.1586, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.058195
Epoch 1093
Loss = 3.4228e-02, PNorm = 252.1067, GNorm = 0.9545, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.042748
Epoch 1094
Loss = 4.1621e-02, PNorm = 252.2585, GNorm = 0.2404, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.073840
Epoch 1095
Loss = 1.7010e-02, PNorm = 252.4176, GNorm = 1.0717, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.046256
Epoch 1096
Loss = 2.7729e-02, PNorm = 252.5434, GNorm = 0.9743, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.042578
Epoch 1097
Loss = 4.1436e-03, PNorm = 252.6912, GNorm = 0.1802, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.042930
Epoch 1098
Loss = 4.3500e-02, PNorm = 252.8017, GNorm = 0.3417, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.034715
Epoch 1099
Loss = 1.0019e-02, PNorm = 252.8930, GNorm = 0.2698, lr_0 = 9.9963e-04
Loss = 2.4100e-02, PNorm = 253.0084, GNorm = 1.9281, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.060129
Epoch 1100
Loss = 9.1565e-03, PNorm = 253.1120, GNorm = 0.0272, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.047588
Epoch 1101
Loss = 3.3089e-02, PNorm = 253.1992, GNorm = 0.7759, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.039938
Epoch 1102
Loss = 2.3606e-02, PNorm = 253.3006, GNorm = 0.0927, lr_0 = 9.9963e-04
Validation binary_cross_entropy = 0.053846
Epoch 1103
Loss = 2.2489e-02, PNorm = 253.4004, GNorm = 0.3687, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.041486
Epoch 1104
Loss = 3.0846e-02, PNorm = 253.5021, GNorm = 0.1953, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.042198
Epoch 1105
Loss = 7.9647e-03, PNorm = 253.6005, GNorm = 1.8141, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.104701
Epoch 1106
Loss = 5.8455e-02, PNorm = 253.6781, GNorm = 1.1050, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.041902
Epoch 1107
Loss = 1.6853e-02, PNorm = 253.7313, GNorm = 1.8422, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.078306
Epoch 1108
Loss = 3.4138e-02, PNorm = 253.8623, GNorm = 1.3869, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.043388
Epoch 1109
Loss = 1.4507e-02, PNorm = 253.9818, GNorm = 0.3243, lr_0 = 9.9962e-04
Loss = 4.1265e-02, PNorm = 254.1074, GNorm = 1.1711, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.079624
Epoch 1110
Loss = 6.6025e-02, PNorm = 254.2596, GNorm = 3.1180, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.145215
Epoch 1111
Loss = 9.4047e-02, PNorm = 254.5521, GNorm = 0.5617, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.061748
Epoch 1112
Loss = 2.9445e-02, PNorm = 254.8366, GNorm = 1.6512, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.102122
Epoch 1113
Loss = 2.0289e-02, PNorm = 255.0116, GNorm = 1.6141, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.065489
Epoch 1114
Loss = 3.8841e-02, PNorm = 255.1427, GNorm = 0.6444, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.055464
Epoch 1115
Loss = 1.5797e-02, PNorm = 255.2907, GNorm = 0.0492, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.057159
Epoch 1116
Loss = 8.6258e-03, PNorm = 255.4059, GNorm = 0.2156, lr_0 = 9.9962e-04
Validation binary_cross_entropy = 0.047787
Epoch 1117
Loss = 1.5635e-02, PNorm = 255.5118, GNorm = 1.1955, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.091347
Epoch 1118
Loss = 9.9811e-04, PNorm = 255.6585, GNorm = 0.0291, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.073865
Epoch 1119
Loss = 2.5068e-03, PNorm = 255.7813, GNorm = 0.0738, lr_0 = 9.9961e-04
Loss = 4.7478e-02, PNorm = 255.9212, GNorm = 1.0402, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.083771
Epoch 1120
Loss = 3.2488e-02, PNorm = 256.0692, GNorm = 0.7248, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.050198
Epoch 1121
Loss = 3.2109e-02, PNorm = 256.2381, GNorm = 1.1749, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.087689
Epoch 1122
Loss = 4.3253e-02, PNorm = 256.3848, GNorm = 2.4313, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.066290
Epoch 1123
Loss = 5.0856e-02, PNorm = 256.5324, GNorm = 0.0669, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.065073
Epoch 1124
Loss = 2.3689e-02, PNorm = 256.6852, GNorm = 0.1741, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.060264
Epoch 1125
Loss = 1.5514e-02, PNorm = 256.8503, GNorm = 0.8357, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.064190
Epoch 1126
Loss = 1.8082e-02, PNorm = 256.9873, GNorm = 1.1366, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.059499
Epoch 1127
Loss = 1.8806e-02, PNorm = 257.0855, GNorm = 0.2409, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.036781
Epoch 1128
Loss = 4.8787e-02, PNorm = 257.2381, GNorm = 0.1573, lr_0 = 9.9961e-04
Validation binary_cross_entropy = 0.050273
Epoch 1129
Loss = 1.6367e-01, PNorm = 257.4105, GNorm = 1.1903, lr_0 = 9.9961e-04
Loss = 2.5620e-02, PNorm = 257.5509, GNorm = 0.1669, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.057450
Epoch 1130
Loss = 2.1488e-02, PNorm = 257.6625, GNorm = 0.0508, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.058176
Epoch 1131
Loss = 2.2076e-02, PNorm = 257.7453, GNorm = 0.2431, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.040638
Epoch 1132
Loss = 7.0614e-03, PNorm = 257.8348, GNorm = 0.1203, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.049676
Epoch 1133
Loss = 3.7388e-03, PNorm = 257.9230, GNorm = 0.0373, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.064450
Epoch 1134
Loss = 1.3648e-02, PNorm = 258.0042, GNorm = 1.6815, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.051075
Epoch 1135
Loss = 3.4159e-02, PNorm = 258.1206, GNorm = 0.8061, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.060824
Epoch 1136
Loss = 8.2720e-03, PNorm = 258.3030, GNorm = 0.1915, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.049521
Epoch 1137
Loss = 8.8399e-03, PNorm = 258.4436, GNorm = 0.2282, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.047378
Epoch 1138
Loss = 1.7923e-02, PNorm = 258.5635, GNorm = 0.9140, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.081825
Epoch 1139
Loss = 2.4531e-02, PNorm = 258.7256, GNorm = 0.8274, lr_0 = 9.9960e-04
Loss = 2.4122e-02, PNorm = 258.8947, GNorm = 1.0021, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.059565
Epoch 1140
Loss = 3.0322e-02, PNorm = 259.0644, GNorm = 0.4108, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.045447
Epoch 1141
Loss = 3.9520e-02, PNorm = 259.2126, GNorm = 0.2266, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.047488
Epoch 1142
Loss = 2.1289e-02, PNorm = 259.3468, GNorm = 2.1096, lr_0 = 9.9960e-04
Validation binary_cross_entropy = 0.072298
Epoch 1143
Loss = 3.1226e-02, PNorm = 259.4620, GNorm = 1.7782, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.038785
Epoch 1144
Loss = 2.6647e-02, PNorm = 259.5897, GNorm = 1.0599, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.055392
Epoch 1145
Loss = 3.3021e-02, PNorm = 259.7056, GNorm = 0.0776, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.042318
Epoch 1146
Loss = 2.7291e-02, PNorm = 259.8009, GNorm = 0.4671, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.042066
Epoch 1147
Loss = 8.9331e-03, PNorm = 259.9183, GNorm = 0.4344, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.064206
Epoch 1148
Loss = 5.7009e-03, PNorm = 260.0388, GNorm = 0.1828, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.090539
Epoch 1149
Loss = 5.7665e-02, PNorm = 260.1413, GNorm = 1.6906, lr_0 = 9.9959e-04
Loss = 5.5286e-03, PNorm = 260.2195, GNorm = 0.1763, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.074341
Epoch 1150
Loss = 6.9200e-02, PNorm = 260.2852, GNorm = 2.4315, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.056735
Epoch 1151
Loss = 1.5219e-02, PNorm = 260.4117, GNorm = 0.1972, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.072619
Epoch 1152
Loss = 1.4175e-02, PNorm = 260.5445, GNorm = 0.0956, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.063901
Epoch 1153
Loss = 3.9718e-02, PNorm = 260.6625, GNorm = 0.4472, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.103608
Epoch 1154
Loss = 1.3906e-02, PNorm = 260.7918, GNorm = 1.8250, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.157359
Epoch 1155
Loss = 1.4563e-01, PNorm = 260.9473, GNorm = 0.5730, lr_0 = 9.9959e-04
Validation binary_cross_entropy = 0.079155
Epoch 1156
Loss = 2.4651e-02, PNorm = 261.2168, GNorm = 0.2782, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.084333
Epoch 1157
Loss = 1.6401e-02, PNorm = 261.4830, GNorm = 1.0098, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.051735
Epoch 1158
Loss = 1.1900e-02, PNorm = 261.7003, GNorm = 0.5575, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.082721
Epoch 1159
Loss = 1.1958e-02, PNorm = 261.8792, GNorm = 0.4861, lr_0 = 9.9958e-04
Loss = 8.6100e-02, PNorm = 262.0236, GNorm = 0.6598, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.036881
Epoch 1160
Loss = 5.7332e-02, PNorm = 262.2131, GNorm = 0.3167, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.067119
Epoch 1161
Loss = 3.6885e-02, PNorm = 262.3746, GNorm = 0.1242, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.048548
Epoch 1162
Loss = 5.5037e-02, PNorm = 262.4828, GNorm = 0.2937, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.077120
Epoch 1163
Loss = 4.8288e-02, PNorm = 262.6246, GNorm = 0.7101, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.061130
Epoch 1164
Loss = 6.5914e-02, PNorm = 262.7794, GNorm = 0.4726, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.092415
Epoch 1165
Loss = 4.5376e-02, PNorm = 262.9468, GNorm = 0.6077, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.048763
Epoch 1166
Loss = 3.2256e-02, PNorm = 263.1008, GNorm = 0.3909, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.082286
Epoch 1167
Loss = 2.9632e-02, PNorm = 263.2343, GNorm = 0.0198, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.092400
Epoch 1168
Loss = 6.0250e-02, PNorm = 263.3633, GNorm = 0.1611, lr_0 = 9.9958e-04
Validation binary_cross_entropy = 0.081234
Epoch 1169
Loss = 6.0831e-02, PNorm = 263.5075, GNorm = 1.3724, lr_0 = 9.9958e-04
Loss = 4.0538e-02, PNorm = 263.6446, GNorm = 0.8063, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.053973
Epoch 1170
Loss = 2.1010e-02, PNorm = 263.7899, GNorm = 0.2241, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.069010
Epoch 1171
Loss = 2.0151e-02, PNorm = 263.9181, GNorm = 0.2653, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.057326
Epoch 1172
Loss = 3.9424e-02, PNorm = 264.0504, GNorm = 2.7140, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.070387
Epoch 1173
Loss = 3.1383e-02, PNorm = 264.2370, GNorm = 1.6346, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.052174
Epoch 1174
Loss = 5.0079e-02, PNorm = 264.4668, GNorm = 0.8959, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.076350
Epoch 1175
Loss = 2.7952e-02, PNorm = 264.6918, GNorm = 1.6255, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.048997
Epoch 1176
Loss = 2.5996e-02, PNorm = 264.8541, GNorm = 0.1432, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.062045
Epoch 1177
Loss = 1.8420e-03, PNorm = 264.9708, GNorm = 0.1674, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.118631
Epoch 1178
Loss = 1.4497e-02, PNorm = 265.0867, GNorm = 0.6509, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.055534
Epoch 1179
Loss = 1.5981e-02, PNorm = 265.2290, GNorm = 0.5616, lr_0 = 9.9957e-04
Loss = 2.2725e-02, PNorm = 265.3543, GNorm = 0.1101, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.070195
Epoch 1180
Loss = 5.5836e-03, PNorm = 265.4517, GNorm = 0.0530, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.073557
Epoch 1181
Loss = 1.0696e-02, PNorm = 265.5140, GNorm = 0.5479, lr_0 = 9.9957e-04
Validation binary_cross_entropy = 0.054587
Epoch 1182
Loss = 2.5090e-02, PNorm = 265.5806, GNorm = 1.1161, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.076796
Epoch 1183
Loss = 1.7806e-02, PNorm = 265.6715, GNorm = 1.7013, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.075564
Epoch 1184
Loss = 4.6543e-02, PNorm = 265.8108, GNorm = 0.3426, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.102443
Epoch 1185
Loss = 7.9870e-02, PNorm = 265.9649, GNorm = 1.6482, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.096812
Epoch 1186
Loss = 5.4816e-02, PNorm = 266.1365, GNorm = 1.7371, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.120644
Epoch 1187
Loss = 1.3155e-02, PNorm = 266.2904, GNorm = 0.4125, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.116808
Epoch 1188
Loss = 5.9691e-02, PNorm = 266.4198, GNorm = 2.1328, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.081360
Epoch 1189
Loss = 1.7914e-02, PNorm = 266.5545, GNorm = 0.6583, lr_0 = 9.9956e-04
Loss = 1.4214e-02, PNorm = 266.6950, GNorm = 0.1526, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.115323
Epoch 1190
Loss = 2.7306e-02, PNorm = 266.7820, GNorm = 0.1606, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.079684
Epoch 1191
Loss = 1.0346e-02, PNorm = 266.8581, GNorm = 0.2484, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.065672
Epoch 1192
Loss = 1.1299e-02, PNorm = 266.9395, GNorm = 0.0692, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.083519
Epoch 1193
Loss = 1.5101e-02, PNorm = 267.0092, GNorm = 0.2722, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.062710
Epoch 1194
Loss = 9.9155e-03, PNorm = 267.1090, GNorm = 0.3211, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.053567
Epoch 1195
Loss = 1.4392e-02, PNorm = 267.2528, GNorm = 0.2330, lr_0 = 9.9956e-04
Validation binary_cross_entropy = 0.097002
Epoch 1196
Loss = 9.4985e-02, PNorm = 267.3851, GNorm = 0.7195, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.087045
Epoch 1197
Loss = 3.4568e-02, PNorm = 267.5306, GNorm = 0.4239, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.053960
Epoch 1198
Loss = 6.2359e-03, PNorm = 267.6538, GNorm = 0.1000, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.061168
Epoch 1199
Loss = 1.0411e-02, PNorm = 267.7947, GNorm = 0.2501, lr_0 = 9.9955e-04
Loss = 2.5569e-02, PNorm = 267.9225, GNorm = 0.1018, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.076620
Epoch 1200
Loss = 2.6197e-02, PNorm = 268.0365, GNorm = 0.6428, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.050038
Epoch 1201
Loss = 3.6295e-02, PNorm = 268.1710, GNorm = 0.9216, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.053453
Epoch 1202
Loss = 3.1998e-02, PNorm = 268.2909, GNorm = 0.3863, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.051920
Epoch 1203
Loss = 2.3417e-02, PNorm = 268.4392, GNorm = 0.1752, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.074828
Epoch 1204
Loss = 7.3157e-03, PNorm = 268.5517, GNorm = 0.0469, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.093674
Epoch 1205
Loss = 4.6232e-02, PNorm = 268.6369, GNorm = 1.6131, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.061593
Epoch 1206
Loss = 4.6176e-03, PNorm = 268.7283, GNorm = 0.1106, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.147268
Epoch 1207
Loss = 2.8881e-03, PNorm = 268.8146, GNorm = 0.1349, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.079431
Epoch 1208
Loss = 3.6757e-02, PNorm = 268.8835, GNorm = 1.0829, lr_0 = 9.9955e-04
Validation binary_cross_entropy = 0.083011
Epoch 1209
Loss = 9.9977e-03, PNorm = 269.0506, GNorm = 0.3681, lr_0 = 9.9954e-04
Loss = 7.5219e-02, PNorm = 269.1861, GNorm = 1.4856, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.062767
Epoch 1210
Loss = 4.7177e-02, PNorm = 269.3837, GNorm = 0.4107, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.051882
Epoch 1211
Loss = 2.8154e-02, PNorm = 269.5847, GNorm = 0.1636, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.078107
Epoch 1212
Loss = 2.8516e-02, PNorm = 269.7417, GNorm = 0.1143, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.090375
Epoch 1213
Loss = 1.4819e-02, PNorm = 269.8743, GNorm = 2.4631, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.091445
Epoch 1214
Loss = 4.1000e-02, PNorm = 270.0170, GNorm = 3.0076, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.112675
Epoch 1215
Loss = 3.9117e-02, PNorm = 270.1952, GNorm = 0.4837, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.073702
Epoch 1216
Loss = 1.0096e-02, PNorm = 270.3604, GNorm = 0.0995, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.082747
Epoch 1217
Loss = 1.0913e-02, PNorm = 270.4779, GNorm = 0.9063, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.073494
Epoch 1218
Loss = 1.5276e-02, PNorm = 270.5908, GNorm = 0.9013, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.055918
Epoch 1219
Loss = 3.9351e-03, PNorm = 270.7096, GNorm = 0.1206, lr_0 = 9.9954e-04
Loss = 1.5497e-02, PNorm = 270.8692, GNorm = 0.1823, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.080745
Epoch 1220
Loss = 7.3663e-02, PNorm = 270.9991, GNorm = 0.6241, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.050476
Epoch 1221
Loss = 3.9481e-02, PNorm = 271.1546, GNorm = 1.9780, lr_0 = 9.9954e-04
Validation binary_cross_entropy = 0.071710
Epoch 1222
Loss = 1.1071e-02, PNorm = 271.3163, GNorm = 0.3195, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.067580
Epoch 1223
Loss = 2.0895e-02, PNorm = 271.4606, GNorm = 0.0180, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.061616
Epoch 1224
Loss = 5.8235e-02, PNorm = 271.6198, GNorm = 1.0585, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.058375
Epoch 1225
Loss = 5.1068e-02, PNorm = 271.8315, GNorm = 1.0795, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.079909
Epoch 1226
Loss = 2.4456e-02, PNorm = 272.0349, GNorm = 0.1308, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.067533
Epoch 1227
Loss = 1.3614e-02, PNorm = 272.1888, GNorm = 0.0545, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.066694
Epoch 1228
Loss = 4.2451e-03, PNorm = 272.2987, GNorm = 0.7725, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.114061
Epoch 1229
Loss = 5.1823e-03, PNorm = 272.4196, GNorm = 0.5443, lr_0 = 9.9953e-04
Loss = 6.6357e-02, PNorm = 272.5998, GNorm = 1.6731, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.071868
Epoch 1230
Loss = 2.9363e-02, PNorm = 272.7906, GNorm = 0.1971, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.105951
Epoch 1231
Loss = 6.0886e-02, PNorm = 272.9486, GNorm = 1.9181, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.061581
Epoch 1232
Loss = 4.4634e-02, PNorm = 273.1319, GNorm = 1.2366, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.060272
Epoch 1233
Loss = 2.5821e-02, PNorm = 273.2749, GNorm = 0.0495, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.067481
Epoch 1234
Loss = 6.0786e-02, PNorm = 273.3956, GNorm = 0.7286, lr_0 = 9.9953e-04
Validation binary_cross_entropy = 0.058103
Epoch 1235
Loss = 2.0295e-02, PNorm = 273.5333, GNorm = 0.2237, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.063702
Epoch 1236
Loss = 6.8371e-03, PNorm = 273.6622, GNorm = 0.1615, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.061071
Epoch 1237
Loss = 4.1793e-02, PNorm = 273.7824, GNorm = 0.6881, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.073382
Epoch 1238
Loss = 1.8545e-02, PNorm = 273.9131, GNorm = 1.1883, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.076208
Epoch 1239
Loss = 4.5673e-02, PNorm = 274.0543, GNorm = 0.5806, lr_0 = 9.9952e-04
Loss = 3.2222e-02, PNorm = 274.1883, GNorm = 0.6036, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.051551
Epoch 1240
Loss = 1.8728e-02, PNorm = 274.3051, GNorm = 1.4222, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.063498
Epoch 1241
Loss = 4.2059e-02, PNorm = 274.3927, GNorm = 2.6234, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.053198
Epoch 1242
Loss = 5.1815e-03, PNorm = 274.4929, GNorm = 0.6339, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.058757
Epoch 1243
Loss = 1.5430e-02, PNorm = 274.5817, GNorm = 0.1886, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.089339
Epoch 1244
Loss = 1.9905e-02, PNorm = 274.6819, GNorm = 0.4482, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.061735
Epoch 1245
Loss = 3.9260e-02, PNorm = 274.7821, GNorm = 2.0187, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.061948
Epoch 1246
Loss = 1.3651e-02, PNorm = 274.8776, GNorm = 0.0400, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.079632
Epoch 1247
Loss = 1.3301e-02, PNorm = 275.0028, GNorm = 0.1934, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.058514
Epoch 1248
Loss = 3.1894e-03, PNorm = 275.1474, GNorm = 0.0812, lr_0 = 9.9952e-04
Validation binary_cross_entropy = 0.086214
Epoch 1249
Loss = 2.3295e-03, PNorm = 275.2476, GNorm = 0.1116, lr_0 = 9.9951e-04
Loss = 5.8717e-02, PNorm = 275.3415, GNorm = 2.2178, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.070051
Epoch 1250
Loss = 3.7737e-02, PNorm = 275.5143, GNorm = 0.4397, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.049557
Epoch 1251
Loss = 8.0832e-02, PNorm = 275.7136, GNorm = 1.6382, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.081162
Epoch 1252
Loss = 4.7036e-02, PNorm = 275.9111, GNorm = 0.1803, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.060012
Epoch 1253
Loss = 2.7647e-02, PNorm = 276.0918, GNorm = 0.4025, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.060751
Epoch 1254
Loss = 2.0938e-03, PNorm = 276.2197, GNorm = 0.0642, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.082300
Epoch 1255
Loss = 2.9503e-03, PNorm = 276.3032, GNorm = 0.4519, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.067256
Epoch 1256
Loss = 2.1800e-02, PNorm = 276.3935, GNorm = 0.7639, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.105296
Epoch 1257
Loss = 2.0007e-02, PNorm = 276.5676, GNorm = 0.0233, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.062685
Epoch 1258
Loss = 1.8482e-02, PNorm = 276.7691, GNorm = 0.6109, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.099363
Epoch 1259
Loss = 1.0518e-01, PNorm = 276.9608, GNorm = 1.5050, lr_0 = 9.9951e-04
Loss = 3.9579e-02, PNorm = 277.1312, GNorm = 1.5166, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.056169
Epoch 1260
Loss = 3.1421e-02, PNorm = 277.2854, GNorm = 0.5778, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.059535
Epoch 1261
Loss = 1.1551e-01, PNorm = 277.4535, GNorm = 0.7224, lr_0 = 9.9951e-04
Validation binary_cross_entropy = 0.057558
Epoch 1262
Loss = 2.6045e-02, PNorm = 277.6349, GNorm = 0.5554, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.053691
Epoch 1263
Loss = 2.0690e-02, PNorm = 277.7930, GNorm = 0.3488, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.065004
Epoch 1264
Loss = 1.1568e-02, PNorm = 277.9281, GNorm = 0.2788, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.063974
Epoch 1265
Loss = 1.2564e-02, PNorm = 278.0313, GNorm = 0.0261, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.085690
Epoch 1266
Loss = 3.8322e-03, PNorm = 278.1165, GNorm = 0.2606, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.053595
Epoch 1267
Loss = 1.7097e-02, PNorm = 278.2120, GNorm = 1.7109, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.055709
Epoch 1268
Loss = 3.2917e-02, PNorm = 278.3069, GNorm = 0.4524, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.054794
Epoch 1269
Loss = 1.3081e-01, PNorm = 278.4098, GNorm = 1.5610, lr_0 = 9.9950e-04
Loss = 1.9938e-02, PNorm = 278.5402, GNorm = 0.0876, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.093757
Epoch 1270
Loss = 1.5524e-02, PNorm = 278.6535, GNorm = 0.0368, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.063864
Epoch 1271
Loss = 3.3621e-02, PNorm = 278.7543, GNorm = 3.5909, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.055421
Epoch 1272
Loss = 4.7402e-02, PNorm = 278.9227, GNorm = 1.1009, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.071586
Epoch 1273
Loss = 4.2020e-02, PNorm = 279.0976, GNorm = 0.2703, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.058576
Epoch 1274
Loss = 2.0858e-02, PNorm = 279.2248, GNorm = 0.1521, lr_0 = 9.9950e-04
Validation binary_cross_entropy = 0.062328
Epoch 1275
Loss = 1.2735e-02, PNorm = 279.3197, GNorm = 0.0555, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.068647
Epoch 1276
Loss = 4.9566e-02, PNorm = 279.3836, GNorm = 0.6520, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.057823
Epoch 1277
Loss = 2.4960e-02, PNorm = 279.4472, GNorm = 1.3243, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.098911
Epoch 1278
Loss = 9.6258e-03, PNorm = 279.5268, GNorm = 0.8060, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.066773
Epoch 1279
Loss = 2.2742e-02, PNorm = 279.5826, GNorm = 1.0446, lr_0 = 9.9949e-04
Loss = 1.2207e-02, PNorm = 279.6477, GNorm = 0.2402, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.071554
Epoch 1280
Loss = 2.0793e-02, PNorm = 279.7134, GNorm = 0.9664, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.073999
Epoch 1281
Loss = 2.6715e-02, PNorm = 279.7953, GNorm = 0.1633, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.058068
Epoch 1282
Loss = 3.8201e-02, PNorm = 279.9175, GNorm = 0.0839, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.070357
Epoch 1283
Loss = 2.1504e-02, PNorm = 280.0187, GNorm = 1.0142, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.059270
Epoch 1284
Loss = 2.2057e-02, PNorm = 280.1095, GNorm = 0.1153, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.055878
Epoch 1285
Loss = 6.5267e-02, PNorm = 280.1846, GNorm = 0.6468, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.057435
Epoch 1286
Loss = 4.5316e-02, PNorm = 280.2726, GNorm = 0.8544, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.058297
Epoch 1287
Loss = 6.1755e-03, PNorm = 280.3724, GNorm = 0.3965, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.101247
Epoch 1288
Loss = 1.3099e-02, PNorm = 280.4563, GNorm = 0.3160, lr_0 = 9.9949e-04
Validation binary_cross_entropy = 0.062757
Epoch 1289
Loss = 3.9216e-03, PNorm = 280.5177, GNorm = 0.2836, lr_0 = 9.9948e-04
Loss = 5.4836e-03, PNorm = 280.5805, GNorm = 0.2051, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.074376
Epoch 1290
Loss = 2.3500e-02, PNorm = 280.6803, GNorm = 0.6973, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.057995
Epoch 1291
Loss = 2.8350e-02, PNorm = 280.8278, GNorm = 2.6904, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.167222
Epoch 1292
Loss = 9.5981e-02, PNorm = 281.0043, GNorm = 1.7029, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.063991
Epoch 1293
Loss = 4.6349e-02, PNorm = 281.2654, GNorm = 0.8272, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.068999
Epoch 1294
Loss = 4.4969e-02, PNorm = 281.4999, GNorm = 0.2894, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.095308
Epoch 1295
Loss = 4.4046e-02, PNorm = 281.6845, GNorm = 1.2475, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.060564
Epoch 1296
Loss = 2.6216e-02, PNorm = 281.8302, GNorm = 1.7397, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.083484
Epoch 1297
Loss = 2.0795e-03, PNorm = 281.9438, GNorm = 0.0351, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.092396
Epoch 1298
Loss = 1.8358e-02, PNorm = 282.0212, GNorm = 2.1307, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.077023
Epoch 1299
Loss = 9.5415e-04, PNorm = 282.0861, GNorm = 0.0599, lr_0 = 9.9948e-04
Loss = 4.2002e-03, PNorm = 282.1414, GNorm = 0.1141, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.081023
Epoch 1300
Loss = 1.0892e-02, PNorm = 282.1990, GNorm = 1.3173, lr_0 = 9.9948e-04
Validation binary_cross_entropy = 0.102271
Epoch 1301
Loss = 8.4472e-03, PNorm = 282.2578, GNorm = 1.0345, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.172339
Epoch 1302
Loss = 4.5693e-02, PNorm = 282.3208, GNorm = 1.2757, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.138975
Epoch 1303
Loss = 1.5798e-01, PNorm = 282.4002, GNorm = 3.4423, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.061941
Epoch 1304
Loss = 4.7692e-02, PNorm = 282.5835, GNorm = 0.5278, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.075647
Epoch 1305
Loss = 3.7042e-02, PNorm = 282.7894, GNorm = 0.5159, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.075056
Epoch 1306
Loss = 3.7821e-02, PNorm = 282.9566, GNorm = 0.8292, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.042948
Epoch 1307
Loss = 6.5589e-03, PNorm = 283.1096, GNorm = 0.1469, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.084080
Epoch 1308
Loss = 3.9367e-03, PNorm = 283.2263, GNorm = 0.0642, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.058396
Epoch 1309
Loss = 3.0531e-03, PNorm = 283.3012, GNorm = 0.1477, lr_0 = 9.9947e-04
Loss = 4.7468e-03, PNorm = 283.3733, GNorm = 0.0606, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.058780
Epoch 1310
Loss = 3.8100e-02, PNorm = 283.4462, GNorm = 0.0812, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.054943
Epoch 1311
Loss = 1.2298e-02, PNorm = 283.5430, GNorm = 0.4683, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.056472
Epoch 1312
Loss = 4.3114e-02, PNorm = 283.6336, GNorm = 0.4785, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.054288
Epoch 1313
Loss = 2.3399e-02, PNorm = 283.7384, GNorm = 0.0455, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.053922
Epoch 1314
Loss = 3.3142e-02, PNorm = 283.8448, GNorm = 1.0322, lr_0 = 9.9947e-04
Validation binary_cross_entropy = 0.043624
Epoch 1315
Loss = 1.2855e-02, PNorm = 283.9544, GNorm = 0.1957, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.050073
Epoch 1316
Loss = 1.7008e-02, PNorm = 284.0886, GNorm = 0.1645, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.115138
Epoch 1317
Loss = 1.0372e-01, PNorm = 284.2255, GNorm = 0.2020, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.053195
Epoch 1318
Loss = 2.2075e-02, PNorm = 284.3697, GNorm = 0.1486, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.048668
Epoch 1319
Loss = 2.6600e-02, PNorm = 284.4993, GNorm = 0.7017, lr_0 = 9.9946e-04
Loss = 4.8536e-02, PNorm = 284.6239, GNorm = 1.0051, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.083629
Epoch 1320
Loss = 3.9796e-02, PNorm = 284.7605, GNorm = 0.5589, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.041483
Epoch 1321
Loss = 3.2528e-02, PNorm = 284.9075, GNorm = 1.6441, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.088228
Epoch 1322
Loss = 3.5393e-02, PNorm = 285.0385, GNorm = 0.8395, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.062326
Epoch 1323
Loss = 3.1353e-02, PNorm = 285.1943, GNorm = 0.8812, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.040618
Epoch 1324
Loss = 2.3309e-02, PNorm = 285.3607, GNorm = 0.3304, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.044051
Epoch 1325
Loss = 4.4796e-02, PNorm = 285.5037, GNorm = 0.6877, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.041986
Epoch 1326
Loss = 2.2330e-02, PNorm = 285.6621, GNorm = 0.7771, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.051158
Epoch 1327
Loss = 8.6857e-03, PNorm = 285.7937, GNorm = 0.0723, lr_0 = 9.9946e-04
Validation binary_cross_entropy = 0.048484
Epoch 1328
Loss = 4.6093e-03, PNorm = 285.8903, GNorm = 0.1226, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.048813
Epoch 1329
Loss = 2.3999e-02, PNorm = 285.9901, GNorm = 0.6733, lr_0 = 9.9945e-04
Loss = 2.7257e-02, PNorm = 286.0966, GNorm = 0.0102, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.062513
Epoch 1330
Loss = 3.0999e-02, PNorm = 286.1780, GNorm = 0.2503, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.054371
Epoch 1331
Loss = 2.2158e-02, PNorm = 286.2532, GNorm = 0.1659, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.056564
Epoch 1332
Loss = 9.3456e-03, PNorm = 286.3323, GNorm = 0.3430, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.068748
Epoch 1333
Loss = 1.1303e-02, PNorm = 286.4058, GNorm = 0.0781, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.061940
Epoch 1334
Loss = 3.3571e-02, PNorm = 286.4697, GNorm = 0.0427, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.058176
Epoch 1335
Loss = 7.4146e-03, PNorm = 286.5779, GNorm = 0.1446, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.058044
Epoch 1336
Loss = 9.4869e-03, PNorm = 286.7128, GNorm = 0.4131, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.057848
Epoch 1337
Loss = 1.0455e-02, PNorm = 286.8680, GNorm = 0.0803, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.065479
Epoch 1338
Loss = 6.5848e-02, PNorm = 287.0161, GNorm = 1.7686, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.055252
Epoch 1339
Loss = 4.6640e-02, PNorm = 287.1394, GNorm = 1.7904, lr_0 = 9.9945e-04
Loss = 3.4282e-02, PNorm = 287.2867, GNorm = 0.2107, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.057345
Epoch 1340
Loss = 2.8878e-02, PNorm = 287.4249, GNorm = 0.5074, lr_0 = 9.9945e-04
Validation binary_cross_entropy = 0.058756
Epoch 1341
Loss = 5.7018e-02, PNorm = 287.5498, GNorm = 0.8704, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.056794
Epoch 1342
Loss = 1.6040e-02, PNorm = 287.6824, GNorm = 0.2512, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.080154
Epoch 1343
Loss = 2.4422e-02, PNorm = 287.7903, GNorm = 0.2555, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.091353
Epoch 1344
Loss = 5.3029e-02, PNorm = 287.9085, GNorm = 0.9617, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.082921
Epoch 1345
Loss = 2.1090e-02, PNorm = 288.0060, GNorm = 1.9468, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.044111
Epoch 1346
Loss = 4.2946e-02, PNorm = 288.1002, GNorm = 0.7988, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.069982
Epoch 1347
Loss = 1.3431e-02, PNorm = 288.2551, GNorm = 0.4328, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.061353
Epoch 1348
Loss = 4.7154e-03, PNorm = 288.3907, GNorm = 0.5816, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.150624
Epoch 1349
Loss = 1.1192e-03, PNorm = 288.5003, GNorm = 0.2878, lr_0 = 9.9944e-04
Loss = 3.6420e-02, PNorm = 288.5915, GNorm = 0.8747, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.067296
Epoch 1350
Loss = 3.9348e-02, PNorm = 288.7108, GNorm = 1.6844, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.042850
Epoch 1351
Loss = 5.3444e-02, PNorm = 288.9120, GNorm = 1.2945, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.060104
Epoch 1352
Loss = 2.8266e-02, PNorm = 289.1203, GNorm = 0.1835, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.041521
Epoch 1353
Loss = 1.8085e-02, PNorm = 289.3031, GNorm = 0.1033, lr_0 = 9.9944e-04
Validation binary_cross_entropy = 0.055830
Epoch 1354
Loss = 3.5967e-02, PNorm = 289.4552, GNorm = 0.7202, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.041219
Epoch 1355
Loss = 1.2907e-02, PNorm = 289.6002, GNorm = 0.1175, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.063256
Epoch 1356
Loss = 7.3008e-03, PNorm = 289.7129, GNorm = 0.1240, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.043072
Epoch 1357
Loss = 8.4336e-03, PNorm = 289.8377, GNorm = 0.1071, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.100298
Epoch 1358
Loss = 1.0911e-01, PNorm = 289.9731, GNorm = 0.0102, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.042646
Epoch 1359
Loss = 3.3058e-02, PNorm = 290.1307, GNorm = 0.9111, lr_0 = 9.9943e-04
Loss = 2.2953e-02, PNorm = 290.3431, GNorm = 0.0414, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.079410
Epoch 1360
Loss = 4.5363e-02, PNorm = 290.4809, GNorm = 0.2105, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.042436
Epoch 1361
Loss = 3.1993e-02, PNorm = 290.6219, GNorm = 0.0878, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.075244
Epoch 1362
Loss = 3.0152e-02, PNorm = 290.7418, GNorm = 0.0417, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.068775
Epoch 1363
Loss = 3.7921e-02, PNorm = 290.8517, GNorm = 1.2609, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.047152
Epoch 1364
Loss = 1.3163e-02, PNorm = 290.9846, GNorm = 0.0482, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.097023
Epoch 1365
Loss = 6.2458e-02, PNorm = 291.0744, GNorm = 2.0624, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.051150
Epoch 1366
Loss = 2.0895e-02, PNorm = 291.1793, GNorm = 1.7477, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.091713
Epoch 1367
Loss = 7.0307e-02, PNorm = 291.2981, GNorm = 2.5164, lr_0 = 9.9943e-04
Validation binary_cross_entropy = 0.110321
Epoch 1368
Loss = 1.5759e-02, PNorm = 291.4000, GNorm = 0.8407, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.078004
Epoch 1369
Loss = 1.6155e-02, PNorm = 291.5061, GNorm = 0.8435, lr_0 = 9.9942e-04
Loss = 3.7177e-02, PNorm = 291.6012, GNorm = 2.2026, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.154034
Epoch 1370
Loss = 1.1359e-01, PNorm = 291.7052, GNorm = 3.3264, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.063120
Epoch 1371
Loss = 6.2045e-02, PNorm = 291.8977, GNorm = 1.8916, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.069265
Epoch 1372
Loss = 3.4372e-02, PNorm = 292.1121, GNorm = 0.7959, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.072799
Epoch 1373
Loss = 2.3731e-02, PNorm = 292.2878, GNorm = 0.7406, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.069647
Epoch 1374
Loss = 3.4992e-02, PNorm = 292.4315, GNorm = 1.0383, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.089648
Epoch 1375
Loss = 4.9656e-02, PNorm = 292.5843, GNorm = 0.6169, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.069250
Epoch 1376
Loss = 2.8969e-02, PNorm = 292.7497, GNorm = 1.3477, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.076263
Epoch 1377
Loss = 1.2510e-02, PNorm = 292.9341, GNorm = 0.4665, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.079946
Epoch 1378
Loss = 9.5126e-04, PNorm = 293.0864, GNorm = 0.0330, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.090730
Epoch 1379
Loss = 1.8127e-02, PNorm = 293.2022, GNorm = 1.7431, lr_0 = 9.9942e-04
Loss = 2.5646e-02, PNorm = 293.3204, GNorm = 2.5579, lr_0 = 9.9942e-04
Validation binary_cross_entropy = 0.133298
Epoch 1380
Loss = 6.5281e-02, PNorm = 293.5125, GNorm = 2.2588, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.050323
Epoch 1381
Loss = 5.3516e-02, PNorm = 293.7575, GNorm = 0.5839, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.083504
Epoch 1382
Loss = 4.7105e-02, PNorm = 293.9855, GNorm = 0.6767, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.068606
Epoch 1383
Loss = 2.4857e-02, PNorm = 294.1662, GNorm = 0.0930, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.081620
Epoch 1384
Loss = 3.8145e-02, PNorm = 294.2804, GNorm = 0.2752, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.064687
Epoch 1385
Loss = 3.3040e-02, PNorm = 294.4002, GNorm = 0.1130, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.076117
Epoch 1386
Loss = 4.5261e-02, PNorm = 294.5220, GNorm = 1.6987, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.069101
Epoch 1387
Loss = 5.4268e-02, PNorm = 294.6460, GNorm = 1.1359, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.064439
Epoch 1388
Loss = 5.6302e-03, PNorm = 294.7745, GNorm = 0.1187, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.093026
Epoch 1389
Loss = 1.6371e-03, PNorm = 294.9277, GNorm = 0.0552, lr_0 = 9.9941e-04
Loss = 3.4759e-02, PNorm = 295.0966, GNorm = 1.6488, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.061291
Epoch 1390
Loss = 3.8832e-02, PNorm = 295.2681, GNorm = 0.0448, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.094204
Epoch 1391
Loss = 4.3946e-02, PNorm = 295.4266, GNorm = 0.1036, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.055591
Epoch 1392
Loss = 4.1877e-02, PNorm = 295.5751, GNorm = 1.0170, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.063212
Epoch 1393
Loss = 3.4136e-02, PNorm = 295.7207, GNorm = 0.5024, lr_0 = 9.9941e-04
Validation binary_cross_entropy = 0.062057
Epoch 1394
Loss = 3.5634e-03, PNorm = 295.8370, GNorm = 0.2196, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.065209
Epoch 1395
Loss = 7.2886e-03, PNorm = 295.9368, GNorm = 1.0733, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.060061
Epoch 1396
Loss = 6.5730e-03, PNorm = 296.0553, GNorm = 0.1021, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.059080
Epoch 1397
Loss = 9.2330e-03, PNorm = 296.1778, GNorm = 0.4522, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.054267
Epoch 1398
Loss = 1.4323e-02, PNorm = 296.3599, GNorm = 0.3418, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.084402
Epoch 1399
Loss = 4.5724e-02, PNorm = 296.5213, GNorm = 3.2900, lr_0 = 9.9940e-04
Loss = 3.9092e-02, PNorm = 296.6908, GNorm = 0.0870, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.062322
Epoch 1400
Loss = 3.5742e-02, PNorm = 296.8632, GNorm = 2.2147, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.099933
Epoch 1401
Loss = 2.4106e-02, PNorm = 297.0034, GNorm = 0.4547, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.062274
Epoch 1402
Loss = 3.4848e-02, PNorm = 297.1477, GNorm = 0.4936, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.064330
Epoch 1403
Loss = 1.8212e-02, PNorm = 297.2769, GNorm = 1.5561, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.099289
Epoch 1404
Loss = 5.0326e-02, PNorm = 297.4001, GNorm = 0.0503, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.059102
Epoch 1405
Loss = 4.4315e-02, PNorm = 297.5342, GNorm = 1.6156, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.103735
Epoch 1406
Loss = 2.3975e-02, PNorm = 297.6818, GNorm = 1.1428, lr_0 = 9.9940e-04
Validation binary_cross_entropy = 0.102282
Epoch 1407
Loss = 4.9847e-02, PNorm = 297.8165, GNorm = 0.4235, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.074156
Epoch 1408
Loss = 1.4758e-02, PNorm = 297.9667, GNorm = 0.9570, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.084033
Epoch 1409
Loss = 4.1624e-03, PNorm = 298.1202, GNorm = 0.1835, lr_0 = 9.9939e-04
Loss = 2.4483e-02, PNorm = 298.2351, GNorm = 1.6208, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.082318
Epoch 1410
Loss = 5.6581e-02, PNorm = 298.3354, GNorm = 1.7461, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.099114
Epoch 1411
Loss = 3.2901e-02, PNorm = 298.4681, GNorm = 0.5820, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.068028
Epoch 1412
Loss = 2.0330e-02, PNorm = 298.6115, GNorm = 1.0895, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.057015
Epoch 1413
Loss = 1.8469e-02, PNorm = 298.7347, GNorm = 0.3424, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.071247
Epoch 1414
Loss = 6.6608e-02, PNorm = 298.8821, GNorm = 2.2920, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.082201
Epoch 1415
Loss = 1.2522e-02, PNorm = 299.0676, GNorm = 1.1396, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.086040
Epoch 1416
Loss = 6.3190e-02, PNorm = 299.2182, GNorm = 3.6502, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.048058
Epoch 1417
Loss = 4.1622e-02, PNorm = 299.3550, GNorm = 0.3555, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.058794
Epoch 1418
Loss = 2.4064e-02, PNorm = 299.4761, GNorm = 0.2417, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.060680
Epoch 1419
Loss = 5.3695e-03, PNorm = 299.5896, GNorm = 0.1547, lr_0 = 9.9939e-04
Loss = 2.7595e-02, PNorm = 299.7193, GNorm = 1.9245, lr_0 = 9.9939e-04
Validation binary_cross_entropy = 0.053902
Epoch 1420
Loss = 2.2899e-02, PNorm = 299.8552, GNorm = 0.6736, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.063697
Epoch 1421
Loss = 2.5147e-02, PNorm = 299.9702, GNorm = 0.2969, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.068642
Epoch 1422
Loss = 6.7970e-02, PNorm = 300.0838, GNorm = 0.2633, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.043947
Epoch 1423
Loss = 2.9323e-02, PNorm = 300.2445, GNorm = 1.0473, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.070565
Epoch 1424
Loss = 4.4409e-02, PNorm = 300.3681, GNorm = 1.8338, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.053462
Epoch 1425
Loss = 3.1123e-02, PNorm = 300.4860, GNorm = 0.2281, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.068574
Epoch 1426
Loss = 2.3097e-02, PNorm = 300.6179, GNorm = 0.9905, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.068418
Epoch 1427
Loss = 9.3645e-03, PNorm = 300.7137, GNorm = 0.0241, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.064568
Epoch 1428
Loss = 1.4491e-02, PNorm = 300.8035, GNorm = 0.1133, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.062606
Epoch 1429
Loss = 1.8779e-01, PNorm = 300.9262, GNorm = 2.2990, lr_0 = 9.9938e-04
Loss = 4.2559e-02, PNorm = 301.0583, GNorm = 2.1937, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.060698
Epoch 1430
Loss = 8.1307e-03, PNorm = 301.1965, GNorm = 0.0309, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.112232
Epoch 1431
Loss = 2.7338e-02, PNorm = 301.3103, GNorm = 1.7286, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.044874
Epoch 1432
Loss = 4.2123e-02, PNorm = 301.4405, GNorm = 1.4766, lr_0 = 9.9938e-04
Validation binary_cross_entropy = 0.072888
Epoch 1433
Loss = 1.7064e-02, PNorm = 301.5807, GNorm = 0.0332, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.046200
Epoch 1434
Loss = 1.9272e-02, PNorm = 301.7106, GNorm = 0.1000, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.048837
Epoch 1435
Loss = 8.7663e-03, PNorm = 301.8341, GNorm = 0.3988, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.070053
Epoch 1436
Loss = 4.9553e-02, PNorm = 301.9462, GNorm = 0.0900, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.066376
Epoch 1437
Loss = 6.2998e-02, PNorm = 302.0860, GNorm = 1.0967, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.046937
Epoch 1438
Loss = 5.2530e-02, PNorm = 302.2476, GNorm = 1.4080, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.069686
Epoch 1439
Loss = 1.2624e-02, PNorm = 302.4267, GNorm = 0.3395, lr_0 = 9.9937e-04
Loss = 3.2552e-02, PNorm = 302.5615, GNorm = 1.3044, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.055048
Epoch 1440
Loss = 2.8301e-02, PNorm = 302.6974, GNorm = 0.7849, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.058953
Epoch 1441
Loss = 3.4014e-02, PNorm = 302.8287, GNorm = 0.7782, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.056231
Epoch 1442
Loss = 1.6755e-02, PNorm = 302.9799, GNorm = 1.6004, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.155473
Epoch 1443
Loss = 1.8957e-02, PNorm = 303.0998, GNorm = 0.1632, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.072096
Epoch 1444
Loss = 5.0501e-02, PNorm = 303.2668, GNorm = 0.0640, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.066482
Epoch 1445
Loss = 4.3904e-02, PNorm = 303.4223, GNorm = 3.6999, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.059519
Epoch 1446
Loss = 6.3610e-02, PNorm = 303.6124, GNorm = 1.4022, lr_0 = 9.9937e-04
Validation binary_cross_entropy = 0.041705
Epoch 1447
Loss = 3.2868e-02, PNorm = 303.7838, GNorm = 0.3003, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.082512
Epoch 1448
Loss = 4.0323e-02, PNorm = 303.9505, GNorm = 0.0025, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.057742
Epoch 1449
Loss = 2.6134e-03, PNorm = 304.0588, GNorm = 0.0986, lr_0 = 9.9936e-04
Loss = 2.9322e-02, PNorm = 304.1701, GNorm = 1.2523, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.049586
Epoch 1450
Loss = 1.8421e-02, PNorm = 304.2996, GNorm = 2.2822, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.090463
Epoch 1451
Loss = 2.3213e-02, PNorm = 304.4057, GNorm = 0.5500, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.060088
Epoch 1452
Loss = 1.0551e-02, PNorm = 304.4922, GNorm = 0.0723, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.056979
Epoch 1453
Loss = 4.1895e-02, PNorm = 304.5757, GNorm = 0.0552, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.069022
Epoch 1454
Loss = 3.3340e-02, PNorm = 304.6778, GNorm = 0.2089, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.051845
Epoch 1455
Loss = 3.6144e-02, PNorm = 304.7804, GNorm = 0.4192, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.054895
Epoch 1456
Loss = 2.2439e-02, PNorm = 304.8726, GNorm = 1.1876, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.055713
Epoch 1457
Loss = 4.0992e-02, PNorm = 304.9428, GNorm = 0.0941, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.051265
Epoch 1458
Loss = 2.4767e-03, PNorm = 305.0197, GNorm = 0.1013, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.055779
Epoch 1459
Loss = 6.1797e-03, PNorm = 305.1055, GNorm = 0.6168, lr_0 = 9.9936e-04
Loss = 1.2743e-02, PNorm = 305.1947, GNorm = 1.4188, lr_0 = 9.9936e-04
Validation binary_cross_entropy = 0.143328
Epoch 1460
Loss = 5.0063e-02, PNorm = 305.3039, GNorm = 0.3073, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.079561
Epoch 1461
Loss = 2.6759e-02, PNorm = 305.4959, GNorm = 1.3541, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.130888
Epoch 1462
Loss = 4.1473e-02, PNorm = 305.6407, GNorm = 0.5174, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.126311
Epoch 1463
Loss = 3.4075e-02, PNorm = 305.7338, GNorm = 0.9900, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.109328
Epoch 1464
Loss = 2.4746e-02, PNorm = 305.8244, GNorm = 0.4497, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.064280
Epoch 1465
Loss = 1.9977e-02, PNorm = 305.9261, GNorm = 1.3511, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.063053
Epoch 1466
Loss = 4.7176e-02, PNorm = 306.0227, GNorm = 1.9144, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.050947
Epoch 1467
Loss = 1.7332e-02, PNorm = 306.1146, GNorm = 0.3489, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.058190
Epoch 1468
Loss = 7.4294e-03, PNorm = 306.2015, GNorm = 0.0831, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.069064
Epoch 1469
Loss = 1.1799e-02, PNorm = 306.2929, GNorm = 0.6358, lr_0 = 9.9935e-04
Loss = 2.6597e-02, PNorm = 306.3752, GNorm = 0.0262, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.068031
Epoch 1470
Loss = 5.0868e-02, PNorm = 306.4307, GNorm = 0.2002, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.055943
Epoch 1471
Loss = 2.9405e-02, PNorm = 306.4762, GNorm = 0.9805, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.045336
Epoch 1472
Loss = 3.3111e-02, PNorm = 306.5514, GNorm = 0.5780, lr_0 = 9.9935e-04
Validation binary_cross_entropy = 0.052899
Epoch 1473
Loss = 5.7876e-02, PNorm = 306.6352, GNorm = 0.2183, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.047395
Epoch 1474
Loss = 3.1797e-02, PNorm = 306.7250, GNorm = 0.1591, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.069395
Epoch 1475
Loss = 5.6681e-02, PNorm = 306.8087, GNorm = 0.8762, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.058884
Epoch 1476
Loss = 3.6957e-02, PNorm = 306.8862, GNorm = 0.4321, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.062318
Epoch 1477
Loss = 2.4235e-02, PNorm = 306.9651, GNorm = 0.0934, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.070561
Epoch 1478
Loss = 2.1515e-02, PNorm = 307.0263, GNorm = 0.0465, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.052073
Epoch 1479
Loss = 1.0246e-02, PNorm = 307.0909, GNorm = 0.3719, lr_0 = 9.9934e-04
Loss = 2.4971e-02, PNorm = 307.1944, GNorm = 0.7286, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.052818
Epoch 1480
Loss = 1.4328e-02, PNorm = 307.2896, GNorm = 0.0563, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.057139
Epoch 1481
Loss = 3.7169e-02, PNorm = 307.3789, GNorm = 0.5436, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.056737
Epoch 1482
Loss = 2.3748e-02, PNorm = 307.5123, GNorm = 0.0531, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.052497
Epoch 1483
Loss = 1.2014e-02, PNorm = 307.6159, GNorm = 0.3366, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.049353
Epoch 1484
Loss = 2.7668e-02, PNorm = 307.7268, GNorm = 0.8576, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.053273
Epoch 1485
Loss = 2.1906e-02, PNorm = 307.8502, GNorm = 0.2795, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.068649
Epoch 1486
Loss = 8.5610e-02, PNorm = 307.9363, GNorm = 1.4099, lr_0 = 9.9934e-04
Validation binary_cross_entropy = 0.046811
Epoch 1487
Loss = 9.7408e-03, PNorm = 308.0074, GNorm = 0.3964, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.055418
Epoch 1488
Loss = 2.9683e-03, PNorm = 308.0772, GNorm = 0.0764, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.057925
Epoch 1489
Loss = 1.4167e-02, PNorm = 308.1383, GNorm = 1.4407, lr_0 = 9.9933e-04
Loss = 2.1581e-02, PNorm = 308.2042, GNorm = 0.2844, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.053884
Epoch 1490
Loss = 3.8599e-02, PNorm = 308.2918, GNorm = 0.5118, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.068658
Epoch 1491
Loss = 2.0336e-02, PNorm = 308.3943, GNorm = 0.5102, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.053684
Epoch 1492
Loss = 2.8149e-02, PNorm = 308.5039, GNorm = 1.6673, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.091589
Epoch 1493
Loss = 5.3146e-02, PNorm = 308.6388, GNorm = 1.5536, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.051518
Epoch 1494
Loss = 2.6437e-02, PNorm = 308.7570, GNorm = 0.9238, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.067692
Epoch 1495
Loss = 2.0401e-02, PNorm = 308.8610, GNorm = 0.3562, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.059472
Epoch 1496
Loss = 5.8995e-02, PNorm = 308.9513, GNorm = 0.1463, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.055605
Epoch 1497
Loss = 4.2568e-02, PNorm = 309.0368, GNorm = 1.2408, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.064782
Epoch 1498
Loss = 2.8855e-02, PNorm = 309.1093, GNorm = 1.2804, lr_0 = 9.9933e-04
Validation binary_cross_entropy = 0.049703
Epoch 1499
Loss = 1.0572e-02, PNorm = 309.1924, GNorm = 0.4487, lr_0 = 9.9933e-04
Loss = 1.0656e-02, PNorm = 309.2788, GNorm = 0.3135, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.071333
Epoch 1500
Loss = 2.4688e-02, PNorm = 309.3532, GNorm = 0.0205, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.064260
Epoch 1501
Loss = 2.0922e-02, PNorm = 309.4575, GNorm = 0.9041, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.069090
Epoch 1502
Loss = 3.8010e-02, PNorm = 309.5859, GNorm = 0.0267, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.062380
Epoch 1503
Loss = 2.0436e-02, PNorm = 309.6863, GNorm = 0.1116, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.053669
Epoch 1504
Loss = 2.3525e-02, PNorm = 309.7891, GNorm = 1.3897, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.050853
Epoch 1505
Loss = 1.6224e-02, PNorm = 309.8681, GNorm = 0.2923, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.061060
Epoch 1506
Loss = 3.3506e-02, PNorm = 309.9611, GNorm = 2.1291, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.044448
Epoch 1507
Loss = 1.4198e-02, PNorm = 310.0727, GNorm = 0.6133, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.076302
Epoch 1508
Loss = 3.3969e-03, PNorm = 310.2126, GNorm = 0.3338, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.090537
Epoch 1509
Loss = 6.2536e-04, PNorm = 310.3493, GNorm = 0.0272, lr_0 = 9.9932e-04
Loss = 8.0260e-02, PNorm = 310.4762, GNorm = 0.9290, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.045911
Epoch 1510
Loss = 7.5224e-02, PNorm = 310.6369, GNorm = 0.6490, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.072427
Epoch 1511
Loss = 3.3584e-02, PNorm = 310.8234, GNorm = 1.1146, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.042875
Epoch 1512
Loss = 2.0456e-02, PNorm = 310.9599, GNorm = 0.0522, lr_0 = 9.9932e-04
Validation binary_cross_entropy = 0.065474
Epoch 1513
Loss = 1.2654e-02, PNorm = 311.0983, GNorm = 0.1002, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.053670
Epoch 1514
Loss = 2.9603e-02, PNorm = 311.2184, GNorm = 1.1111, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.053469
Epoch 1515
Loss = 8.0060e-03, PNorm = 311.3476, GNorm = 0.2055, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.068241
Epoch 1516
Loss = 3.0223e-02, PNorm = 311.4694, GNorm = 1.0326, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.050566
Epoch 1517
Loss = 6.1552e-02, PNorm = 311.5628, GNorm = 1.6911, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.058863
Epoch 1518
Loss = 7.4095e-02, PNorm = 311.6840, GNorm = 1.1041, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.066808
Epoch 1519
Loss = 3.1771e-03, PNorm = 311.8006, GNorm = 0.0941, lr_0 = 9.9931e-04
Loss = 1.9497e-02, PNorm = 311.8950, GNorm = 0.0386, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.068425
Epoch 1520
Loss = 3.2443e-02, PNorm = 312.0314, GNorm = 1.3247, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.043519
Epoch 1521
Loss = 5.1754e-02, PNorm = 312.2062, GNorm = 0.7196, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.138266
Epoch 1522
Loss = 1.0743e-02, PNorm = 312.3701, GNorm = 0.0845, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.098102
Epoch 1523
Loss = 2.7244e-02, PNorm = 312.4858, GNorm = 1.3414, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.099934
Epoch 1524
Loss = 3.7680e-02, PNorm = 312.5967, GNorm = 1.0526, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.035889
Epoch 1525
Loss = 2.3948e-02, PNorm = 312.7360, GNorm = 0.4084, lr_0 = 9.9931e-04
Validation binary_cross_entropy = 0.040252
Epoch 1526
Loss = 3.0827e-02, PNorm = 312.8887, GNorm = 1.2849, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.048769
Epoch 1527
Loss = 2.0108e-02, PNorm = 312.9963, GNorm = 0.0543, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.041997
Epoch 1528
Loss = 7.0129e-03, PNorm = 313.0760, GNorm = 0.3642, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.059761
Epoch 1529
Loss = 3.5403e-03, PNorm = 313.1390, GNorm = 0.1596, lr_0 = 9.9930e-04
Loss = 2.4893e-02, PNorm = 313.2032, GNorm = 0.0163, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.045755
Epoch 1530
Loss = 2.6646e-02, PNorm = 313.2904, GNorm = 1.8749, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.058487
Epoch 1531
Loss = 9.7705e-03, PNorm = 313.3794, GNorm = 0.0416, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.059345
Epoch 1532
Loss = 4.0551e-02, PNorm = 313.4712, GNorm = 0.2802, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.040034
Epoch 1533
Loss = 1.8665e-02, PNorm = 313.5880, GNorm = 0.6574, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.078234
Epoch 1534
Loss = 8.0278e-03, PNorm = 313.6932, GNorm = 0.4791, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.068395
Epoch 1535
Loss = 1.2953e-03, PNorm = 313.7771, GNorm = 0.0135, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.074470
Epoch 1536
Loss = 5.2450e-02, PNorm = 313.8455, GNorm = 6.1334, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.042139
Epoch 1537
Loss = 2.7439e-02, PNorm = 313.9992, GNorm = 0.7210, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.067349
Epoch 1538
Loss = 1.8266e-02, PNorm = 314.2341, GNorm = 0.1183, lr_0 = 9.9930e-04
Validation binary_cross_entropy = 0.040854
Epoch 1539
Loss = 2.1874e-02, PNorm = 314.4130, GNorm = 0.7835, lr_0 = 9.9930e-04
Loss = 3.0800e-02, PNorm = 314.5635, GNorm = 0.3617, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.063233
Epoch 1540
Loss = 3.1659e-02, PNorm = 314.7030, GNorm = 0.5758, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.045418
Epoch 1541
Loss = 3.3998e-02, PNorm = 314.8362, GNorm = 1.0735, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.048563
Epoch 1542
Loss = 3.3827e-02, PNorm = 314.9751, GNorm = 0.7312, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.053147
Epoch 1543
Loss = 2.6245e-02, PNorm = 315.1305, GNorm = 1.1188, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.044540
Epoch 1544
Loss = 1.6831e-02, PNorm = 315.2871, GNorm = 0.9477, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.056715
Epoch 1545
Loss = 3.2066e-03, PNorm = 315.4115, GNorm = 0.2065, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.105144
Epoch 1546
Loss = 5.1104e-02, PNorm = 315.4800, GNorm = 3.1399, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.039454
Epoch 1547
Loss = 1.0993e-01, PNorm = 315.5197, GNorm = 1.6530, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.056357
Epoch 1548
Loss = 1.9292e-02, PNorm = 315.6627, GNorm = 0.5306, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.053095
Epoch 1549
Loss = 2.7604e-03, PNorm = 315.8318, GNorm = 0.1071, lr_0 = 9.9929e-04
Loss = 1.5295e-02, PNorm = 315.9878, GNorm = 0.9468, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.073444
Epoch 1550
Loss = 2.9335e-02, PNorm = 316.1039, GNorm = 0.0573, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.076121
Epoch 1551
Loss = 2.3231e-02, PNorm = 316.2411, GNorm = 0.1667, lr_0 = 9.9929e-04
Validation binary_cross_entropy = 0.097772
Epoch 1552
Loss = 1.8075e-02, PNorm = 316.3843, GNorm = 0.3721, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.061269
Epoch 1553
Loss = 6.5661e-02, PNorm = 316.5250, GNorm = 3.0611, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.051743
Epoch 1554
Loss = 4.2050e-02, PNorm = 316.6717, GNorm = 1.0355, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.040930
Epoch 1555
Loss = 1.9444e-02, PNorm = 316.8107, GNorm = 0.2616, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.063591
Epoch 1556
Loss = 2.3262e-02, PNorm = 316.9415, GNorm = 1.0536, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.049319
Epoch 1557
Loss = 3.7394e-03, PNorm = 317.0457, GNorm = 0.1335, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.063431
Epoch 1558
Loss = 1.1861e-03, PNorm = 317.1369, GNorm = 0.0492, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.083805
Epoch 1559
Loss = 1.3695e-04, PNorm = 317.2038, GNorm = 0.0064, lr_0 = 9.9928e-04
Loss = 1.1371e-02, PNorm = 317.2675, GNorm = 0.2970, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.065878
Epoch 1560
Loss = 6.3349e-02, PNorm = 317.3744, GNorm = 1.7178, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.066188
Epoch 1561
Loss = 1.9109e-02, PNorm = 317.5136, GNorm = 1.5275, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.046973
Epoch 1562
Loss = 1.5802e-02, PNorm = 317.6341, GNorm = 0.2480, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.078714
Epoch 1563
Loss = 2.5680e-02, PNorm = 317.7358, GNorm = 0.8328, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.043527
Epoch 1564
Loss = 1.6460e-02, PNorm = 317.8603, GNorm = 0.0544, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.078508
Epoch 1565
Loss = 1.7951e-02, PNorm = 317.9668, GNorm = 0.8484, lr_0 = 9.9928e-04
Validation binary_cross_entropy = 0.060407
Epoch 1566
Loss = 6.0814e-02, PNorm = 318.0353, GNorm = 0.2508, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.054370
Epoch 1567
Loss = 3.9685e-02, PNorm = 318.1068, GNorm = 0.3315, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.048024
Epoch 1568
Loss = 1.5131e-02, PNorm = 318.2121, GNorm = 0.5949, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.054797
Epoch 1569
Loss = 3.9259e-03, PNorm = 318.3372, GNorm = 0.1528, lr_0 = 9.9927e-04
Loss = 1.0698e-02, PNorm = 318.4396, GNorm = 0.1122, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.055417
Epoch 1570
Loss = 3.4817e-02, PNorm = 318.5281, GNorm = 0.1189, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.068414
Epoch 1571
Loss = 4.1348e-02, PNorm = 318.6170, GNorm = 0.2023, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.049515
Epoch 1572
Loss = 1.6533e-02, PNorm = 318.7274, GNorm = 1.1701, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.066012
Epoch 1573
Loss = 2.3313e-02, PNorm = 318.8231, GNorm = 0.1336, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.070151
Epoch 1574
Loss = 1.7644e-02, PNorm = 318.9206, GNorm = 0.1402, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.052620
Epoch 1575
Loss = 1.6187e-02, PNorm = 319.0377, GNorm = 1.9364, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.123980
Epoch 1576
Loss = 5.7395e-02, PNorm = 319.1448, GNorm = 0.6918, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.045497
Epoch 1577
Loss = 5.4621e-02, PNorm = 319.2285, GNorm = 1.0730, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.049296
Epoch 1578
Loss = 1.7491e-02, PNorm = 319.3631, GNorm = 0.5777, lr_0 = 9.9927e-04
Validation binary_cross_entropy = 0.080197
Epoch 1579
Loss = 2.8111e-02, PNorm = 319.4804, GNorm = 1.3742, lr_0 = 9.9926e-04
Loss = 4.2216e-03, PNorm = 319.5751, GNorm = 0.0309, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.067133
Epoch 1580
Loss = 2.1334e-02, PNorm = 319.6494, GNorm = 0.0857, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.057973
Epoch 1581
Loss = 3.9062e-02, PNorm = 319.7450, GNorm = 0.8949, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.151772
Epoch 1582
Loss = 2.8446e-02, PNorm = 319.8531, GNorm = 0.1696, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.050546
Epoch 1583
Loss = 2.9287e-02, PNorm = 319.9882, GNorm = 0.6994, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.071710
Epoch 1584
Loss = 8.2151e-03, PNorm = 320.1162, GNorm = 0.1478, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.062396
Epoch 1585
Loss = 1.2137e-02, PNorm = 320.2154, GNorm = 1.4023, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.055885
Epoch 1586
Loss = 1.6129e-02, PNorm = 320.3145, GNorm = 0.0729, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.073471
Epoch 1587
Loss = 7.2648e-03, PNorm = 320.3933, GNorm = 0.3846, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.076081
Epoch 1588
Loss = 2.1896e-03, PNorm = 320.4677, GNorm = 0.0845, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.068715
Epoch 1589
Loss = 1.7017e-03, PNorm = 320.5448, GNorm = 0.1229, lr_0 = 9.9926e-04
Loss = 4.8821e-03, PNorm = 320.6347, GNorm = 0.0813, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.078418
Epoch 1590
Loss = 3.3844e-02, PNorm = 320.6991, GNorm = 0.4390, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.063307
Epoch 1591
Loss = 1.7571e-02, PNorm = 320.7641, GNorm = 1.5270, lr_0 = 9.9926e-04
Validation binary_cross_entropy = 0.049068
Epoch 1592
Loss = 4.9273e-02, PNorm = 320.8620, GNorm = 0.3808, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.086596
Epoch 1593
Loss = 1.8015e-02, PNorm = 321.0009, GNorm = 0.0745, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.045765
Epoch 1594
Loss = 3.9588e-02, PNorm = 321.1472, GNorm = 1.0979, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.047730
Epoch 1595
Loss = 1.9459e-02, PNorm = 321.2780, GNorm = 0.4745, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.058524
Epoch 1596
Loss = 3.6497e-02, PNorm = 321.3834, GNorm = 0.0737, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.048481
Epoch 1597
Loss = 1.1107e-02, PNorm = 321.4861, GNorm = 0.1892, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.050343
Epoch 1598
Loss = 5.6699e-02, PNorm = 321.6053, GNorm = 1.2821, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.042200
Epoch 1599
Loss = 1.0085e-01, PNorm = 321.7228, GNorm = 1.0691, lr_0 = 9.9925e-04
Loss = 3.6556e-03, PNorm = 321.8327, GNorm = 0.0169, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.059726
Epoch 1600
Loss = 3.3203e-03, PNorm = 321.9183, GNorm = 0.0052, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.063785
Epoch 1601
Loss = 1.9613e-02, PNorm = 322.0096, GNorm = 2.1999, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.118130
Epoch 1602
Loss = 5.0229e-02, PNorm = 322.1270, GNorm = 0.0685, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.052145
Epoch 1603
Loss = 5.2732e-02, PNorm = 322.2616, GNorm = 0.4571, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.051871
Epoch 1604
Loss = 1.6832e-02, PNorm = 322.4032, GNorm = 0.9722, lr_0 = 9.9925e-04
Validation binary_cross_entropy = 0.068019
Epoch 1605
Loss = 2.7705e-02, PNorm = 322.5214, GNorm = 0.0784, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.060708
Epoch 1606
Loss = 1.9252e-02, PNorm = 322.6215, GNorm = 2.8162, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.050101
Epoch 1607
Loss = 6.9487e-02, PNorm = 322.7827, GNorm = 0.3793, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.057626
Epoch 1608
Loss = 4.0121e-02, PNorm = 322.9729, GNorm = 0.7991, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.059623
Epoch 1609
Loss = 2.5236e-02, PNorm = 323.1283, GNorm = 1.3367, lr_0 = 9.9924e-04
Loss = 4.1807e-03, PNorm = 323.2389, GNorm = 0.3744, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.140899
Epoch 1610
Loss = 1.8854e-02, PNorm = 323.3103, GNorm = 1.3416, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.067015
Epoch 1611
Loss = 5.4491e-02, PNorm = 323.4254, GNorm = 1.3200, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.064487
Epoch 1612
Loss = 3.8873e-02, PNorm = 323.5741, GNorm = 1.4744, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.155618
Epoch 1613
Loss = 4.4830e-02, PNorm = 323.7095, GNorm = 0.4434, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.074139
Epoch 1614
Loss = 3.9579e-02, PNorm = 323.8537, GNorm = 1.1688, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.130791
Epoch 1615
Loss = 2.9118e-02, PNorm = 323.9954, GNorm = 2.3602, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.081642
Epoch 1616
Loss = 6.7943e-02, PNorm = 324.1424, GNorm = 4.7142, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.115227
Epoch 1617
Loss = 1.1594e-02, PNorm = 324.3329, GNorm = 0.1407, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.142347
Epoch 1618
Loss = 7.7723e-02, PNorm = 324.4798, GNorm = 1.5645, lr_0 = 9.9924e-04
Validation binary_cross_entropy = 0.102517
Epoch 1619
Loss = 9.4854e-03, PNorm = 324.5743, GNorm = 0.5298, lr_0 = 9.9923e-04
Loss = 8.5981e-03, PNorm = 324.6630, GNorm = 0.0385, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.125027
Epoch 1620
Loss = 7.7711e-03, PNorm = 324.7323, GNorm = 0.4553, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.143577
Epoch 1621
Loss = 4.1911e-02, PNorm = 324.7926, GNorm = 0.0683, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.073039
Epoch 1622
Loss = 1.5015e-02, PNorm = 324.8694, GNorm = 1.8695, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.106760
Epoch 1623
Loss = 1.1887e-02, PNorm = 324.9550, GNorm = 0.2698, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.072116
Epoch 1624
Loss = 6.3700e-02, PNorm = 325.0616, GNorm = 3.8742, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.055576
Epoch 1625
Loss = 3.5087e-02, PNorm = 325.2397, GNorm = 0.3110, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.091298
Epoch 1626
Loss = 3.6767e-02, PNorm = 325.4205, GNorm = 0.1988, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.065750
Epoch 1627
Loss = 4.0568e-03, PNorm = 325.5566, GNorm = 0.2825, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.053019
Epoch 1628
Loss = 2.0704e-02, PNorm = 325.6650, GNorm = 1.1446, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.100492
Epoch 1629
Loss = 3.8504e-04, PNorm = 325.7749, GNorm = 0.0182, lr_0 = 9.9923e-04
Loss = 2.6466e-02, PNorm = 325.8479, GNorm = 2.1795, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.090749
Epoch 1630
Loss = 3.4073e-02, PNorm = 325.9695, GNorm = 0.2437, lr_0 = 9.9923e-04
Validation binary_cross_entropy = 0.136768
Epoch 1631
Loss = 3.4239e-02, PNorm = 326.1181, GNorm = 1.0747, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.074178
Epoch 1632
Loss = 4.8133e-02, PNorm = 326.2929, GNorm = 0.7590, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.122373
Epoch 1633
Loss = 3.4664e-02, PNorm = 326.4528, GNorm = 0.2482, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.096798
Epoch 1634
Loss = 1.2643e-02, PNorm = 326.6203, GNorm = 0.1973, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.156855
Epoch 1635
Loss = 1.7614e-02, PNorm = 326.7377, GNorm = 0.1102, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.078668
Epoch 1636
Loss = 1.1501e-02, PNorm = 326.8775, GNorm = 0.1631, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.171528
Epoch 1637
Loss = 6.2921e-02, PNorm = 327.0428, GNorm = 2.6205, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.057450
Epoch 1638
Loss = 7.0597e-02, PNorm = 327.1899, GNorm = 0.4977, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.052797
Epoch 1639
Loss = 2.0253e-02, PNorm = 327.3831, GNorm = 0.6178, lr_0 = 9.9922e-04
Loss = 2.1599e-02, PNorm = 327.5347, GNorm = 0.6208, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.053966
Epoch 1640
Loss = 2.1803e-02, PNorm = 327.6510, GNorm = 0.2197, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.083882
Epoch 1641
Loss = 4.6609e-02, PNorm = 327.7501, GNorm = 0.1845, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.070938
Epoch 1642
Loss = 3.0754e-02, PNorm = 327.8832, GNorm = 1.2100, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.069591
Epoch 1643
Loss = 6.7267e-02, PNorm = 328.0223, GNorm = 2.0554, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.065765
Epoch 1644
Loss = 6.4461e-03, PNorm = 328.1772, GNorm = 0.1810, lr_0 = 9.9922e-04
Validation binary_cross_entropy = 0.063177
Epoch 1645
Loss = 1.7894e-02, PNorm = 328.3101, GNorm = 0.5097, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.039936
Epoch 1646
Loss = 4.9408e-02, PNorm = 328.4568, GNorm = 0.6070, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.048013
Epoch 1647
Loss = 5.9857e-02, PNorm = 328.6065, GNorm = 0.1889, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.047998
Epoch 1648
Loss = 1.3700e-02, PNorm = 328.7325, GNorm = 0.2605, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.107548
Epoch 1649
Loss = 6.4875e-03, PNorm = 328.8507, GNorm = 0.3650, lr_0 = 9.9921e-04
Loss = 2.5418e-02, PNorm = 328.9369, GNorm = 0.3579, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.050221
Epoch 1650
Loss = 8.9281e-03, PNorm = 329.0585, GNorm = 0.1948, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.084270
Epoch 1651
Loss = 2.3458e-02, PNorm = 329.1717, GNorm = 0.0328, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.060062
Epoch 1652
Loss = 3.1588e-02, PNorm = 329.2662, GNorm = 2.5855, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.070610
Epoch 1653
Loss = 5.9302e-02, PNorm = 329.3610, GNorm = 0.5895, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.043335
Epoch 1654
Loss = 1.3035e-02, PNorm = 329.4846, GNorm = 0.5895, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.050120
Epoch 1655
Loss = 7.6192e-03, PNorm = 329.5999, GNorm = 0.0298, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.057293
Epoch 1656
Loss = 9.3297e-03, PNorm = 329.6922, GNorm = 0.0978, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.042960
Epoch 1657
Loss = 3.7065e-02, PNorm = 329.7936, GNorm = 1.1020, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.056325
Epoch 1658
Loss = 1.0023e-02, PNorm = 329.9363, GNorm = 0.4251, lr_0 = 9.9921e-04
Validation binary_cross_entropy = 0.053918
Epoch 1659
Loss = 3.2419e-03, PNorm = 330.0433, GNorm = 0.1581, lr_0 = 9.9920e-04
Loss = 1.0823e-02, PNorm = 330.1156, GNorm = 0.2071, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.055027
Epoch 1660
Loss = 1.2753e-02, PNorm = 330.2113, GNorm = 0.1256, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.065769
Epoch 1661
Loss = 1.8735e-02, PNorm = 330.3051, GNorm = 0.0630, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.067779
Epoch 1662
Loss = 4.0755e-02, PNorm = 330.4045, GNorm = 0.0520, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.043512
Epoch 1663
Loss = 2.2821e-02, PNorm = 330.5470, GNorm = 0.1636, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.092252
Epoch 1664
Loss = 5.3490e-02, PNorm = 330.6696, GNorm = 0.3677, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.052392
Epoch 1665
Loss = 3.1757e-02, PNorm = 330.8040, GNorm = 0.2908, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.060532
Epoch 1666
Loss = 4.4086e-02, PNorm = 330.9093, GNorm = 0.9949, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.069169
Epoch 1667
Loss = 3.7373e-02, PNorm = 331.0107, GNorm = 0.0608, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.073114
Epoch 1668
Loss = 8.2861e-03, PNorm = 331.0960, GNorm = 0.2227, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.086944
Epoch 1669
Loss = 7.8501e-04, PNorm = 331.1719, GNorm = 0.0372, lr_0 = 9.9920e-04
Loss = 4.5091e-02, PNorm = 331.2448, GNorm = 1.0359, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.059584
Epoch 1670
Loss = 5.3560e-02, PNorm = 331.3501, GNorm = 1.5666, lr_0 = 9.9920e-04
Validation binary_cross_entropy = 0.071599
Epoch 1671
Loss = 1.8221e-02, PNorm = 331.4673, GNorm = 2.2092, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.088769
Epoch 1672
Loss = 1.4837e-02, PNorm = 331.5598, GNorm = 0.3210, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.062697
Epoch 1673
Loss = 3.2478e-02, PNorm = 331.6418, GNorm = 1.0657, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.051076
Epoch 1674
Loss = 1.8654e-02, PNorm = 331.7571, GNorm = 0.6265, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.062481
Epoch 1675
Loss = 5.7450e-02, PNorm = 331.8677, GNorm = 0.1289, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.052817
Epoch 1676
Loss = 6.5730e-02, PNorm = 331.9875, GNorm = 2.5104, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.051437
Epoch 1677
Loss = 6.0245e-03, PNorm = 332.1107, GNorm = 0.1140, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.064877
Epoch 1678
Loss = 8.4353e-03, PNorm = 332.2248, GNorm = 0.1631, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.047347
Epoch 1679
Loss = 7.1846e-03, PNorm = 332.3100, GNorm = 0.2277, lr_0 = 9.9919e-04
Loss = 4.5466e-02, PNorm = 332.4332, GNorm = 0.9956, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.043690
Epoch 1680
Loss = 2.9326e-02, PNorm = 332.5903, GNorm = 0.1476, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.056665
Epoch 1681
Loss = 2.7523e-02, PNorm = 332.7179, GNorm = 2.4294, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.047523
Epoch 1682
Loss = 2.5719e-02, PNorm = 332.8491, GNorm = 1.7084, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.055302
Epoch 1683
Loss = 1.8597e-02, PNorm = 332.9789, GNorm = 0.6730, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.072813
Epoch 1684
Loss = 2.1785e-02, PNorm = 333.1051, GNorm = 0.4033, lr_0 = 9.9919e-04
Validation binary_cross_entropy = 0.048223
Epoch 1685
Loss = 1.2312e-02, PNorm = 333.2117, GNorm = 0.0480, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.062004
Epoch 1686
Loss = 6.0772e-02, PNorm = 333.3027, GNorm = 0.0256, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.058163
Epoch 1687
Loss = 1.6374e-02, PNorm = 333.3872, GNorm = 0.1358, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.062594
Epoch 1688
Loss = 7.0878e-03, PNorm = 333.4672, GNorm = 0.0916, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.059668
Epoch 1689
Loss = 1.1938e-03, PNorm = 333.5462, GNorm = 0.0567, lr_0 = 9.9918e-04
Loss = 5.3143e-03, PNorm = 333.6185, GNorm = 0.3658, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.058808
Epoch 1690
Loss = 2.5679e-02, PNorm = 333.7000, GNorm = 0.2153, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.066347
Epoch 1691
Loss = 2.9036e-02, PNorm = 333.8240, GNorm = 0.8118, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.056744
Epoch 1692
Loss = 5.4854e-02, PNorm = 333.9731, GNorm = 0.4223, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.104462
Epoch 1693
Loss = 4.7252e-02, PNorm = 334.1430, GNorm = 0.3512, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.054927
Epoch 1694
Loss = 4.5315e-02, PNorm = 334.2875, GNorm = 0.2120, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.052535
Epoch 1695
Loss = 2.4024e-02, PNorm = 334.4166, GNorm = 1.3097, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.126340
Epoch 1696
Loss = 4.8699e-03, PNorm = 334.5289, GNorm = 0.0466, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.057889
Epoch 1697
Loss = 1.5690e-02, PNorm = 334.6316, GNorm = 0.4727, lr_0 = 9.9918e-04
Validation binary_cross_entropy = 0.069515
Epoch 1698
Loss = 3.6578e-03, PNorm = 334.7594, GNorm = 0.4663, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.073424
Epoch 1699
Loss = 1.2237e-02, PNorm = 334.8480, GNorm = 0.5099, lr_0 = 9.9917e-04
Loss = 1.2328e-02, PNorm = 334.9252, GNorm = 0.0553, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.064981
Epoch 1700
Loss = 1.1357e-02, PNorm = 335.0151, GNorm = 0.0024, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.094109
Epoch 1701
Loss = 4.6992e-02, PNorm = 335.0688, GNorm = 0.0633, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.062285
Epoch 1702
Loss = 3.0773e-02, PNorm = 335.1292, GNorm = 0.9858, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.059023
Epoch 1703
Loss = 2.2496e-02, PNorm = 335.2300, GNorm = 0.9040, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.055232
Epoch 1704
Loss = 2.9691e-02, PNorm = 335.3646, GNorm = 0.4516, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.050523
Epoch 1705
Loss = 9.1689e-03, PNorm = 335.4916, GNorm = 0.8747, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.076418
Epoch 1706
Loss = 1.7588e-02, PNorm = 335.5986, GNorm = 0.4187, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.099415
Epoch 1707
Loss = 2.2089e-04, PNorm = 335.6854, GNorm = 0.0091, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.051167
Epoch 1708
Loss = 4.2341e-02, PNorm = 335.7936, GNorm = 3.5604, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.063702
Epoch 1709
Loss = 6.4594e-03, PNorm = 335.9778, GNorm = 0.2638, lr_0 = 9.9917e-04
Loss = 5.0209e-02, PNorm = 336.1768, GNorm = 0.9476, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.052903
Epoch 1710
Loss = 2.5465e-02, PNorm = 336.4089, GNorm = 0.8458, lr_0 = 9.9917e-04
Validation binary_cross_entropy = 0.088178
Epoch 1711
Loss = 3.3479e-02, PNorm = 336.6054, GNorm = 0.0187, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.064354
Epoch 1712
Loss = 5.6897e-02, PNorm = 336.8376, GNorm = 1.6931, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.080273
Epoch 1713
Loss = 7.1883e-02, PNorm = 337.0648, GNorm = 0.9721, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.058988
Epoch 1714
Loss = 4.5236e-02, PNorm = 337.3003, GNorm = 0.7352, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.056209
Epoch 1715
Loss = 3.8018e-02, PNorm = 337.4851, GNorm = 0.1979, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.174855
Epoch 1716
Loss = 2.0473e-02, PNorm = 337.6436, GNorm = 0.1682, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.074420
Epoch 1717
Loss = 4.2105e-03, PNorm = 337.7728, GNorm = 0.0608, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.054409
Epoch 1718
Loss = 1.4574e-01, PNorm = 337.8923, GNorm = 0.6638, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.057030
Epoch 1719
Loss = 3.0309e-02, PNorm = 338.0230, GNorm = 0.4712, lr_0 = 9.9916e-04
Loss = 1.1196e-02, PNorm = 338.1422, GNorm = 0.2691, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.054317
Epoch 1720
Loss = 3.1669e-02, PNorm = 338.2399, GNorm = 1.0139, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.050686
Epoch 1721
Loss = 3.4511e-02, PNorm = 338.3312, GNorm = 0.2779, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.064415
Epoch 1722
Loss = 6.2630e-03, PNorm = 338.4121, GNorm = 2.1755, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.080230
Epoch 1723
Loss = 6.7847e-02, PNorm = 338.4919, GNorm = 0.2434, lr_0 = 9.9916e-04
Validation binary_cross_entropy = 0.085001
Epoch 1724
Loss = 1.2254e-02, PNorm = 338.5668, GNorm = 0.3425, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.050084
Epoch 1725
Loss = 6.4553e-02, PNorm = 338.6540, GNorm = 2.4589, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.106243
Epoch 1726
Loss = 1.0471e-02, PNorm = 338.7800, GNorm = 0.0752, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.058444
Epoch 1727
Loss = 6.6437e-03, PNorm = 338.9011, GNorm = 0.1158, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.071001
Epoch 1728
Loss = 7.5739e-03, PNorm = 339.0178, GNorm = 0.0139, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.060434
Epoch 1729
Loss = 6.1529e-03, PNorm = 339.1539, GNorm = 0.3853, lr_0 = 9.9915e-04
Loss = 1.2177e-02, PNorm = 339.2906, GNorm = 0.4788, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.082006
Epoch 1730
Loss = 3.4858e-02, PNorm = 339.4119, GNorm = 4.0363, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.045406
Epoch 1731
Loss = 5.5738e-02, PNorm = 339.5763, GNorm = 0.1094, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.078173
Epoch 1732
Loss = 2.0878e-02, PNorm = 339.7305, GNorm = 0.4268, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.075832
Epoch 1733
Loss = 2.5551e-02, PNorm = 339.8630, GNorm = 2.5650, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.049885
Epoch 1734
Loss = 2.8491e-02, PNorm = 339.9954, GNorm = 1.6447, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.057275
Epoch 1735
Loss = 6.5679e-03, PNorm = 340.0739, GNorm = 0.1127, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.045544
Epoch 1736
Loss = 4.7098e-02, PNorm = 340.1713, GNorm = 0.4275, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.047698
Epoch 1737
Loss = 7.3732e-03, PNorm = 340.2809, GNorm = 0.0963, lr_0 = 9.9915e-04
Validation binary_cross_entropy = 0.059573
Epoch 1738
Loss = 2.6933e-02, PNorm = 340.3600, GNorm = 1.1007, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.051002
Epoch 1739
Loss = 6.3671e-03, PNorm = 340.4260, GNorm = 0.2738, lr_0 = 9.9914e-04
Loss = 1.1298e-01, PNorm = 340.5186, GNorm = 6.6447, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.043731
Epoch 1740
Loss = 3.7661e-02, PNorm = 340.6307, GNorm = 1.5610, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.074541
Epoch 1741
Loss = 1.2102e-02, PNorm = 340.7498, GNorm = 0.1533, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.044158
Epoch 1742
Loss = 9.2585e-03, PNorm = 340.8436, GNorm = 0.7723, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.081103
Epoch 1743
Loss = 1.5696e-03, PNorm = 340.9135, GNorm = 0.0047, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.138826
Epoch 1744
Loss = 9.7809e-03, PNorm = 340.9661, GNorm = 2.2748, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.103652
Epoch 1745
Loss = 2.1012e-02, PNorm = 341.0731, GNorm = 1.4986, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.072487
Epoch 1746
Loss = 8.3686e-02, PNorm = 341.2338, GNorm = 0.3955, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.046058
Epoch 1747
Loss = 9.9360e-03, PNorm = 341.4105, GNorm = 0.3644, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.077991
Epoch 1748
Loss = 1.0241e-01, PNorm = 341.5403, GNorm = 1.6458, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.043480
Epoch 1749
Loss = 3.6221e-02, PNorm = 341.6296, GNorm = 1.0135, lr_0 = 9.9914e-04
Loss = 2.9991e-02, PNorm = 341.7223, GNorm = 0.8951, lr_0 = 9.9914e-04
Validation binary_cross_entropy = 0.058245
Epoch 1750
Loss = 3.7279e-02, PNorm = 341.8196, GNorm = 0.5961, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.046708
Epoch 1751
Loss = 1.8849e-02, PNorm = 341.9345, GNorm = 0.2298, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.063921
Epoch 1752
Loss = 2.6292e-02, PNorm = 342.0329, GNorm = 0.2217, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.072890
Epoch 1753
Loss = 2.2636e-02, PNorm = 342.1047, GNorm = 0.0400, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.057431
Epoch 1754
Loss = 4.3444e-02, PNorm = 342.1732, GNorm = 1.4262, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.048053
Epoch 1755
Loss = 1.9596e-02, PNorm = 342.2694, GNorm = 1.4584, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.045289
Epoch 1756
Loss = 2.7204e-02, PNorm = 342.3568, GNorm = 0.6605, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.045788
Epoch 1757
Loss = 1.9332e-02, PNorm = 342.4651, GNorm = 0.0682, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.047597
Epoch 1758
Loss = 2.1372e-03, PNorm = 342.5578, GNorm = 0.0981, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.062043
Epoch 1759
Loss = 1.0362e-03, PNorm = 342.6455, GNorm = 0.0447, lr_0 = 9.9913e-04
Loss = 6.2640e-03, PNorm = 342.7232, GNorm = 0.0151, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.158855
Epoch 1760
Loss = 6.5575e-03, PNorm = 342.7787, GNorm = 1.5839, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.326782
Epoch 1761
Loss = 4.0678e-02, PNorm = 342.8572, GNorm = 0.2059, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.053021
Epoch 1762
Loss = 3.0103e-02, PNorm = 342.9657, GNorm = 2.7802, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.064116
Epoch 1763
Loss = 7.5709e-03, PNorm = 343.0540, GNorm = 0.0504, lr_0 = 9.9913e-04
Validation binary_cross_entropy = 0.062984
Epoch 1764
Loss = 7.2357e-03, PNorm = 343.1448, GNorm = 0.1761, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.071529
Epoch 1765
Loss = 1.8802e-03, PNorm = 343.2338, GNorm = 0.0479, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.050473
Epoch 1766
Loss = 4.9142e-02, PNorm = 343.3296, GNorm = 0.1408, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.056166
Epoch 1767
Loss = 5.7012e-03, PNorm = 343.4263, GNorm = 0.0440, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.067198
Epoch 1768
Loss = 9.6466e-03, PNorm = 343.4969, GNorm = 0.1582, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.041764
Epoch 1769
Loss = 2.4072e-02, PNorm = 343.5720, GNorm = 0.5462, lr_0 = 9.9912e-04
Loss = 2.3903e-02, PNorm = 343.6825, GNorm = 1.0618, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.045236
Epoch 1770
Loss = 1.7156e-02, PNorm = 343.7972, GNorm = 1.4276, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.100679
Epoch 1771
Loss = 1.6024e-02, PNorm = 343.8736, GNorm = 0.1095, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.060186
Epoch 1772
Loss = 1.1606e-02, PNorm = 343.9381, GNorm = 0.2084, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.055987
Epoch 1773
Loss = 3.1107e-02, PNorm = 344.0077, GNorm = 3.0834, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.054761
Epoch 1774
Loss = 2.5964e-02, PNorm = 344.1020, GNorm = 0.1728, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.052833
Epoch 1775
Loss = 9.4900e-03, PNorm = 344.1954, GNorm = 0.1110, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.055582
Epoch 1776
Loss = 2.3643e-02, PNorm = 344.2856, GNorm = 0.0637, lr_0 = 9.9912e-04
Validation binary_cross_entropy = 0.042489
Epoch 1777
Loss = 3.3390e-02, PNorm = 344.3672, GNorm = 1.8613, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.093880
Epoch 1778
Loss = 1.7707e-01, PNorm = 344.4810, GNorm = 0.6074, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.048787
Epoch 1779
Loss = 1.8569e-02, PNorm = 344.5879, GNorm = 0.5249, lr_0 = 9.9911e-04
Loss = 1.5975e-02, PNorm = 344.7062, GNorm = 0.1886, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.056493
Epoch 1780
Loss = 1.1198e-02, PNorm = 344.7941, GNorm = 0.0286, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.073001
Epoch 1781
Loss = 6.7612e-03, PNorm = 344.8526, GNorm = 0.0093, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.085445
Epoch 1782
Loss = 2.2042e-02, PNorm = 344.8984, GNorm = 0.5856, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.058231
Epoch 1783
Loss = 1.8447e-02, PNorm = 344.9743, GNorm = 0.0682, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.072214
Epoch 1784
Loss = 1.8875e-02, PNorm = 345.0615, GNorm = 0.8223, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.186891
Epoch 1785
Loss = 8.2715e-02, PNorm = 345.1308, GNorm = 1.5971, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.055498
Epoch 1786
Loss = 2.4419e-02, PNorm = 345.2095, GNorm = 0.2766, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.065752
Epoch 1787
Loss = 4.0900e-03, PNorm = 345.3321, GNorm = 0.0222, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.101438
Epoch 1788
Loss = 6.4307e-02, PNorm = 345.4306, GNorm = 1.9412, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.097876
Epoch 1789
Loss = 2.8780e-01, PNorm = 345.5101, GNorm = 5.2910, lr_0 = 9.9911e-04
Loss = 5.0268e-02, PNorm = 345.6366, GNorm = 0.7969, lr_0 = 9.9911e-04
Validation binary_cross_entropy = 0.053742
Epoch 1790
Loss = 8.4319e-02, PNorm = 345.7951, GNorm = 0.6124, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.116822
Epoch 1791
Loss = 8.1585e-02, PNorm = 345.9903, GNorm = 0.3785, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.036943
Epoch 1792
Loss = 3.4106e-02, PNorm = 346.2162, GNorm = 0.1406, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.078470
Epoch 1793
Loss = 3.9338e-02, PNorm = 346.3704, GNorm = 1.1598, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.065948
Epoch 1794
Loss = 5.1909e-02, PNorm = 346.4903, GNorm = 2.6101, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.074511
Epoch 1795
Loss = 6.6875e-03, PNorm = 346.6144, GNorm = 0.1369, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.078125
Epoch 1796
Loss = 3.3837e-02, PNorm = 346.7380, GNorm = 2.2354, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.082843
Epoch 1797
Loss = 5.7130e-02, PNorm = 346.8788, GNorm = 1.7043, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.086897
Epoch 1798
Loss = 4.7577e-02, PNorm = 347.0064, GNorm = 0.5173, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.184438
Epoch 1799
Loss = 4.4207e-02, PNorm = 347.1421, GNorm = 1.6356, lr_0 = 9.9910e-04
Loss = 3.0099e-02, PNorm = 347.2658, GNorm = 0.1530, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.088842
Epoch 1800
Loss = 9.8417e-02, PNorm = 347.4303, GNorm = 1.9087, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.099421
Epoch 1801
Loss = 2.4147e-02, PNorm = 347.6612, GNorm = 0.1967, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.105819
Epoch 1802
Loss = 6.7653e-02, PNorm = 347.8654, GNorm = 0.9849, lr_0 = 9.9910e-04
Validation binary_cross_entropy = 0.079143
Epoch 1803
Loss = 4.2783e-02, PNorm = 348.0556, GNorm = 0.1477, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.069435
Epoch 1804
Loss = 4.6661e-02, PNorm = 348.2197, GNorm = 0.5082, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.046960
Epoch 1805
Loss = 2.3849e-02, PNorm = 348.3875, GNorm = 0.4822, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.062681
Epoch 1806
Loss = 4.3646e-02, PNorm = 348.5296, GNorm = 1.1437, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.065365
Epoch 1807
Loss = 2.3599e-02, PNorm = 348.6700, GNorm = 1.9009, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.067142
Epoch 1808
Loss = 7.2463e-02, PNorm = 348.7938, GNorm = 1.0004, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.050970
Epoch 1809
Loss = 1.6769e-02, PNorm = 348.9172, GNorm = 0.8085, lr_0 = 9.9909e-04
Loss = 3.1723e-02, PNorm = 349.0638, GNorm = 0.8001, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.071722
Epoch 1810
Loss = 3.3337e-02, PNorm = 349.2103, GNorm = 2.0723, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.096576
Epoch 1811
Loss = 1.1993e-01, PNorm = 349.3430, GNorm = 1.3111, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.047119
Epoch 1812
Loss = 5.9994e-02, PNorm = 349.5133, GNorm = 1.1548, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.115791
Epoch 1813
Loss = 3.1435e-02, PNorm = 349.7546, GNorm = 1.7296, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.045409
Epoch 1814
Loss = 7.3373e-02, PNorm = 349.9447, GNorm = 1.4596, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.058471
Epoch 1815
Loss = 2.9489e-02, PNorm = 350.0813, GNorm = 0.8923, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.073688
Epoch 1816
Loss = 3.9425e-02, PNorm = 350.1797, GNorm = 0.0747, lr_0 = 9.9909e-04
Validation binary_cross_entropy = 0.074899
Epoch 1817
Loss = 4.6618e-02, PNorm = 350.2433, GNorm = 0.3252, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.068325
Epoch 1818
Loss = 3.4355e-03, PNorm = 350.3042, GNorm = 0.0607, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.111294
Epoch 1819
Loss = 5.5313e-03, PNorm = 350.3873, GNorm = 0.1953, lr_0 = 9.9908e-04
Loss = 2.7047e-02, PNorm = 350.4898, GNorm = 0.2580, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.062563
Epoch 1820
Loss = 3.2792e-02, PNorm = 350.5964, GNorm = 0.2642, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.053721
Epoch 1821
Loss = 3.8802e-02, PNorm = 350.7004, GNorm = 2.0866, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.049027
Epoch 1822
Loss = 5.0655e-02, PNorm = 350.8212, GNorm = 0.7376, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.067163
Epoch 1823
Loss = 2.5994e-02, PNorm = 350.9538, GNorm = 0.1399, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.061612
Epoch 1824
Loss = 7.3705e-02, PNorm = 351.0625, GNorm = 1.5312, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.052416
Epoch 1825
Loss = 3.3987e-02, PNorm = 351.1999, GNorm = 0.1719, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.079589
Epoch 1826
Loss = 1.0487e-01, PNorm = 351.3371, GNorm = 0.8053, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.061776
Epoch 1827
Loss = 1.7422e-02, PNorm = 351.4628, GNorm = 0.5062, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.056535
Epoch 1828
Loss = 1.4861e-02, PNorm = 351.5919, GNorm = 1.6911, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.071903
Epoch 1829
Loss = 1.3859e-02, PNorm = 351.7313, GNorm = 0.4101, lr_0 = 9.9908e-04
Loss = 4.7977e-02, PNorm = 351.8921, GNorm = 0.3266, lr_0 = 9.9908e-04
Validation binary_cross_entropy = 0.053753
Epoch 1830
Loss = 3.2111e-02, PNorm = 352.0374, GNorm = 0.1251, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.050852
Epoch 1831
Loss = 3.0990e-02, PNorm = 352.1669, GNorm = 1.4889, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.056564
Epoch 1832
Loss = 4.4870e-02, PNorm = 352.2763, GNorm = 4.0552, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.068076
Epoch 1833
Loss = 4.1961e-02, PNorm = 352.3678, GNorm = 0.2197, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.036990
Epoch 1834
Loss = 3.2834e-02, PNorm = 352.5120, GNorm = 0.2489, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.058179
Epoch 1835
Loss = 1.2543e-02, PNorm = 352.6664, GNorm = 0.1391, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.067082
Epoch 1836
Loss = 1.5111e-02, PNorm = 352.7929, GNorm = 0.0381, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.074428
Epoch 1837
Loss = 1.5041e-02, PNorm = 352.8997, GNorm = 0.5964, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.104268
Epoch 1838
Loss = 1.2773e-03, PNorm = 352.9916, GNorm = 0.0613, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.059850
Epoch 1839
Loss = 2.8098e-02, PNorm = 353.0933, GNorm = 1.2033, lr_0 = 9.9907e-04
Loss = 2.9786e-02, PNorm = 353.2524, GNorm = 0.3429, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.119206
Epoch 1840
Loss = 5.1293e-02, PNorm = 353.3935, GNorm = 0.1642, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.100772
Epoch 1841
Loss = 6.6634e-02, PNorm = 353.5373, GNorm = 1.0955, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.039600
Epoch 1842
Loss = 3.9109e-02, PNorm = 353.7359, GNorm = 0.1957, lr_0 = 9.9907e-04
Validation binary_cross_entropy = 0.103264
Epoch 1843
Loss = 4.8029e-02, PNorm = 353.8729, GNorm = 2.6852, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.097882
Epoch 1844
Loss = 2.0831e-02, PNorm = 353.9819, GNorm = 0.5414, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.100124
Epoch 1845
Loss = 1.7484e-02, PNorm = 354.0863, GNorm = 0.0532, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.149610
Epoch 1846
Loss = 3.5716e-02, PNorm = 354.1662, GNorm = 0.0648, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.130781
Epoch 1847
Loss = 4.6656e-02, PNorm = 354.2219, GNorm = 1.1169, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.061649
Epoch 1848
Loss = 3.5059e-02, PNorm = 354.3285, GNorm = 1.0898, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.075152
Epoch 1849
Loss = 2.3420e-03, PNorm = 354.4428, GNorm = 0.0787, lr_0 = 9.9906e-04
Loss = 2.0869e-02, PNorm = 354.5495, GNorm = 0.9236, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.068190
Epoch 1850
Loss = 3.3467e-02, PNorm = 354.6460, GNorm = 1.9866, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.068364
Epoch 1851
Loss = 2.7285e-02, PNorm = 354.7481, GNorm = 0.0891, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.062070
Epoch 1852
Loss = 4.0517e-02, PNorm = 354.8422, GNorm = 0.2435, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.045423
Epoch 1853
Loss = 1.3821e-02, PNorm = 354.9563, GNorm = 0.9659, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.056708
Epoch 1854
Loss = 2.3280e-02, PNorm = 355.0493, GNorm = 1.4624, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.064844
Epoch 1855
Loss = 6.3672e-03, PNorm = 355.1173, GNorm = 0.6684, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.070882
Epoch 1856
Loss = 1.8685e-03, PNorm = 355.1887, GNorm = 0.1452, lr_0 = 9.9906e-04
Validation binary_cross_entropy = 0.068123
Epoch 1857
Loss = 1.6120e-02, PNorm = 355.2678, GNorm = 0.0206, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.059157
Epoch 1858
Loss = 4.7099e-03, PNorm = 355.3482, GNorm = 0.0585, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.060632
Epoch 1859
Loss = 4.6998e-03, PNorm = 355.4260, GNorm = 0.1378, lr_0 = 9.9905e-04
Loss = 2.4425e-02, PNorm = 355.5861, GNorm = 0.6993, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.063085
Epoch 1860
Loss = 7.5210e-03, PNorm = 355.7208, GNorm = 0.9147, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.075768
Epoch 1861
Loss = 2.5879e-02, PNorm = 355.8195, GNorm = 1.1128, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.068167
Epoch 1862
Loss = 5.8584e-02, PNorm = 355.9239, GNorm = 0.7724, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.067698
Epoch 1863
Loss = 5.1232e-02, PNorm = 356.0515, GNorm = 0.8611, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.054774
Epoch 1864
Loss = 5.6759e-02, PNorm = 356.1633, GNorm = 0.2744, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.047574
Epoch 1865
Loss = 3.5687e-02, PNorm = 356.2735, GNorm = 0.6612, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.047886
Epoch 1866
Loss = 1.0955e-02, PNorm = 356.3878, GNorm = 0.2698, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.063466
Epoch 1867
Loss = 5.3180e-03, PNorm = 356.4912, GNorm = 0.0824, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.081883
Epoch 1868
Loss = 3.2484e-02, PNorm = 356.5697, GNorm = 0.0694, lr_0 = 9.9905e-04
Validation binary_cross_entropy = 0.089858
Epoch 1869
Loss = 1.7980e-02, PNorm = 356.6921, GNorm = 0.6161, lr_0 = 9.9905e-04
Loss = 4.4760e-02, PNorm = 356.8144, GNorm = 1.8816, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.112958
Epoch 1870
Loss = 5.2211e-02, PNorm = 356.9876, GNorm = 0.6405, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.074856
Epoch 1871
Loss = 3.1368e-02, PNorm = 357.1506, GNorm = 0.1200, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.088942
Epoch 1872
Loss = 4.0043e-02, PNorm = 357.2562, GNorm = 2.2046, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.068869
Epoch 1873
Loss = 9.2593e-02, PNorm = 357.3417, GNorm = 0.1412, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.064513
Epoch 1874
Loss = 3.6084e-02, PNorm = 357.4544, GNorm = 2.0337, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.050481
Epoch 1875
Loss = 2.4780e-02, PNorm = 357.5599, GNorm = 0.3935, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.058145
Epoch 1876
Loss = 5.5274e-02, PNorm = 357.6669, GNorm = 0.9166, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.057927
Epoch 1877
Loss = 9.5899e-03, PNorm = 357.7648, GNorm = 0.2166, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.054826
Epoch 1878
Loss = 4.9711e-03, PNorm = 357.8611, GNorm = 0.1041, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.065008
Epoch 1879
Loss = 4.7136e-03, PNorm = 357.9512, GNorm = 0.3694, lr_0 = 9.9904e-04
Loss = 6.6650e-02, PNorm = 358.0337, GNorm = 3.3939, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.101537
Epoch 1880
Loss = 3.1973e-02, PNorm = 358.1829, GNorm = 0.2169, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.068504
Epoch 1881
Loss = 1.3749e-02, PNorm = 358.3415, GNorm = 0.1230, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.132108
Epoch 1882
Loss = 2.3497e-02, PNorm = 358.4593, GNorm = 0.8664, lr_0 = 9.9904e-04
Validation binary_cross_entropy = 0.101563
Epoch 1883
Loss = 1.2658e-02, PNorm = 358.5661, GNorm = 0.0581, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.050570
Epoch 1884
Loss = 6.4638e-02, PNorm = 358.7364, GNorm = 1.3154, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.100096
Epoch 1885
Loss = 4.8449e-02, PNorm = 358.9614, GNorm = 1.0982, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.067636
Epoch 1886
Loss = 2.8809e-02, PNorm = 359.1425, GNorm = 0.2486, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.141359
Epoch 1887
Loss = 5.1979e-02, PNorm = 359.3126, GNorm = 0.7359, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.063200
Epoch 1888
Loss = 2.3507e-02, PNorm = 359.4657, GNorm = 0.8231, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.079078
Epoch 1889
Loss = 6.1007e-02, PNorm = 359.5964, GNorm = 0.8123, lr_0 = 9.9903e-04
Loss = 3.4377e-02, PNorm = 359.7088, GNorm = 0.0809, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.067783
Epoch 1890
Loss = 3.4643e-02, PNorm = 359.8103, GNorm = 0.3241, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.054159
Epoch 1891
Loss = 3.3413e-02, PNorm = 359.9069, GNorm = 0.4980, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.119561
Epoch 1892
Loss = 2.4865e-02, PNorm = 360.0009, GNorm = 0.4919, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.074772
Epoch 1893
Loss = 1.2884e-02, PNorm = 360.1009, GNorm = 0.2465, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.050435
Epoch 1894
Loss = 1.8300e-02, PNorm = 360.2080, GNorm = 0.0567, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.055334
Epoch 1895
Loss = 5.1672e-02, PNorm = 360.2939, GNorm = 0.9726, lr_0 = 9.9903e-04
Validation binary_cross_entropy = 0.044750
Epoch 1896
Loss = 8.6326e-03, PNorm = 360.4028, GNorm = 0.2605, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.073187
Epoch 1897
Loss = 2.9793e-03, PNorm = 360.4945, GNorm = 0.1457, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.073383
Epoch 1898
Loss = 1.2166e-01, PNorm = 360.5504, GNorm = 1.0293, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.078793
Epoch 1899
Loss = 4.1743e-03, PNorm = 360.6306, GNorm = 0.1863, lr_0 = 9.9902e-04
Loss = 1.3807e-02, PNorm = 360.7112, GNorm = 0.0238, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.087673
Epoch 1900
Loss = 5.8329e-02, PNorm = 360.7719, GNorm = 0.6857, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.049075
Epoch 1901
Loss = 2.4601e-02, PNorm = 360.8577, GNorm = 1.4547, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.060925
Epoch 1902
Loss = 3.7550e-02, PNorm = 360.9369, GNorm = 0.9240, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.049315
Epoch 1903
Loss = 2.8571e-02, PNorm = 361.0187, GNorm = 1.4287, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.049473
Epoch 1904
Loss = 6.5982e-03, PNorm = 361.1144, GNorm = 0.1336, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.064399
Epoch 1905
Loss = 1.8490e-02, PNorm = 361.1937, GNorm = 1.2045, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.061184
Epoch 1906
Loss = 1.1112e-02, PNorm = 361.2501, GNorm = 0.0225, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.055006
Epoch 1907
Loss = 4.7941e-03, PNorm = 361.3237, GNorm = 0.2568, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.078987
Epoch 1908
Loss = 3.5932e-02, PNorm = 361.4090, GNorm = 0.5570, lr_0 = 9.9902e-04
Validation binary_cross_entropy = 0.054476
Epoch 1909
Loss = 3.1383e-03, PNorm = 361.4817, GNorm = 0.1348, lr_0 = 9.9902e-04
Loss = 1.1704e-02, PNorm = 361.5490, GNorm = 1.3092, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.059136
Epoch 1910
Loss = 2.3116e-02, PNorm = 361.6220, GNorm = 1.5110, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.086978
Epoch 1911
Loss = 2.9742e-02, PNorm = 361.6902, GNorm = 1.9665, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.069351
Epoch 1912
Loss = 8.9253e-03, PNorm = 361.7674, GNorm = 0.2852, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.071379
Epoch 1913
Loss = 2.8166e-02, PNorm = 361.8455, GNorm = 3.1107, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.063801
Epoch 1914
Loss = 7.7961e-02, PNorm = 361.9361, GNorm = 2.1018, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.060561
Epoch 1915
Loss = 2.4480e-02, PNorm = 362.0495, GNorm = 0.7152, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.087811
Epoch 1916
Loss = 4.2142e-02, PNorm = 362.1564, GNorm = 0.1634, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.065338
Epoch 1917
Loss = 1.7878e-02, PNorm = 362.2493, GNorm = 0.8423, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.067216
Epoch 1918
Loss = 4.2473e-02, PNorm = 362.3429, GNorm = 0.0645, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.074424
Epoch 1919
Loss = 4.1041e-02, PNorm = 362.4668, GNorm = 1.1623, lr_0 = 9.9901e-04
Loss = 2.4406e-02, PNorm = 362.5832, GNorm = 0.3509, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.067234
Epoch 1920
Loss = 4.2058e-02, PNorm = 362.6985, GNorm = 0.5253, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.070216
Epoch 1921
Loss = 9.8701e-03, PNorm = 362.8176, GNorm = 0.1792, lr_0 = 9.9901e-04
Validation binary_cross_entropy = 0.081013
Epoch 1922
Loss = 1.4738e-02, PNorm = 362.9020, GNorm = 0.0252, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.085200
Epoch 1923
Loss = 4.3215e-02, PNorm = 362.9928, GNorm = 1.7524, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.179340
Epoch 1924
Loss = 1.5604e-02, PNorm = 363.1250, GNorm = 1.7595, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.065696
Epoch 1925
Loss = 4.7514e-02, PNorm = 363.2503, GNorm = 1.9412, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.079705
Epoch 1926
Loss = 1.1480e-02, PNorm = 363.3852, GNorm = 0.1465, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.088297
Epoch 1927
Loss = 9.8962e-03, PNorm = 363.4948, GNorm = 0.9142, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.080691
Epoch 1928
Loss = 2.4610e-03, PNorm = 363.5867, GNorm = 0.0457, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.125790
Epoch 1929
Loss = 9.1405e-04, PNorm = 363.6583, GNorm = 0.0615, lr_0 = 9.9900e-04
Loss = 7.8428e-03, PNorm = 363.7046, GNorm = 0.5135, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.123008
Epoch 1930
Loss = 3.4645e-02, PNorm = 363.7768, GNorm = 0.0420, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.169916
Epoch 1931
Loss = 2.2675e-02, PNorm = 363.8658, GNorm = 3.0810, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.196518
Epoch 1932
Loss = 4.1190e-02, PNorm = 363.9361, GNorm = 1.6610, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.095544
Epoch 1933
Loss = 4.8532e-02, PNorm = 364.0237, GNorm = 0.8152, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.064010
Epoch 1934
Loss = 2.1709e-02, PNorm = 364.1805, GNorm = 0.2350, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.100045
Epoch 1935
Loss = 5.2161e-02, PNorm = 364.2963, GNorm = 0.3227, lr_0 = 9.9900e-04
Validation binary_cross_entropy = 0.083736
Epoch 1936
Loss = 9.7076e-03, PNorm = 364.3888, GNorm = 0.1494, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.082985
Epoch 1937
Loss = 7.7804e-02, PNorm = 364.4811, GNorm = 2.5136, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.079413
Epoch 1938
Loss = 3.9475e-03, PNorm = 364.5804, GNorm = 0.0923, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.111150
Epoch 1939
Loss = 4.6295e-02, PNorm = 364.6742, GNorm = 1.3884, lr_0 = 9.9899e-04
Loss = 5.8394e-02, PNorm = 364.7775, GNorm = 0.1240, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.089119
Epoch 1940
Loss = 3.2757e-02, PNorm = 364.9103, GNorm = 2.2365, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.077072
Epoch 1941
Loss = 2.1680e-02, PNorm = 365.0318, GNorm = 0.2125, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.099156
Epoch 1942
Loss = 1.9032e-02, PNorm = 365.1222, GNorm = 0.0353, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.068108
Epoch 1943
Loss = 1.3398e-02, PNorm = 365.2038, GNorm = 0.0132, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.088009
Epoch 1944
Loss = 5.3373e-03, PNorm = 365.2793, GNorm = 0.3810, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.120081
Epoch 1945
Loss = 4.5342e-03, PNorm = 365.3378, GNorm = 0.0015, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.095322
Epoch 1946
Loss = 1.4161e-02, PNorm = 365.4029, GNorm = 0.3916, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.067239
Epoch 1947
Loss = 2.9294e-02, PNorm = 365.5016, GNorm = 1.3987, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.117678
Epoch 1948
Loss = 7.3763e-02, PNorm = 365.6223, GNorm = 1.3828, lr_0 = 9.9899e-04
Validation binary_cross_entropy = 0.066641
Epoch 1949
Loss = 7.5840e-03, PNorm = 365.7606, GNorm = 0.1776, lr_0 = 9.9898e-04
Loss = 3.6490e-02, PNorm = 365.8880, GNorm = 3.5538, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.072638
Epoch 1950
Loss = 3.8480e-02, PNorm = 365.9967, GNorm = 0.0887, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.062667
Epoch 1951
Loss = 8.0144e-02, PNorm = 366.1135, GNorm = 3.1734, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.108402
Epoch 1952
Loss = 3.1669e-02, PNorm = 366.2278, GNorm = 0.1736, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.065250
Epoch 1953
Loss = 3.6331e-02, PNorm = 366.3475, GNorm = 1.0643, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.077437
Epoch 1954
Loss = 2.8600e-02, PNorm = 366.4400, GNorm = 0.1109, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.081593
Epoch 1955
Loss = 3.6122e-03, PNorm = 366.5189, GNorm = 0.5900, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.073691
Epoch 1956
Loss = 3.0061e-03, PNorm = 366.5737, GNorm = 0.1606, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.074460
Epoch 1957
Loss = 3.5156e-02, PNorm = 366.6467, GNorm = 0.0791, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.109905
Epoch 1958
Loss = 3.5891e-02, PNorm = 366.7207, GNorm = 1.1051, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.061498
Epoch 1959
Loss = 1.8708e-02, PNorm = 366.7857, GNorm = 0.5882, lr_0 = 9.9898e-04
Loss = 3.6251e-02, PNorm = 366.8836, GNorm = 0.5954, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.064479
Epoch 1960
Loss = 5.1528e-02, PNorm = 366.9829, GNorm = 0.2264, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.083387
Epoch 1961
Loss = 9.0371e-03, PNorm = 367.0916, GNorm = 0.3630, lr_0 = 9.9898e-04
Validation binary_cross_entropy = 0.079146
Epoch 1962
Loss = 2.5206e-02, PNorm = 367.1878, GNorm = 0.3486, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.061726
Epoch 1963
Loss = 1.6743e-02, PNorm = 367.3067, GNorm = 1.1635, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.110571
Epoch 1964
Loss = 5.9733e-02, PNorm = 367.4075, GNorm = 0.0904, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.066387
Epoch 1965
Loss = 6.0838e-03, PNorm = 367.5131, GNorm = 0.1017, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.095563
Epoch 1966
Loss = 3.3357e-02, PNorm = 367.5917, GNorm = 0.3938, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.077431
Epoch 1967
Loss = 2.1870e-03, PNorm = 367.6581, GNorm = 0.0529, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.068617
Epoch 1968
Loss = 2.6758e-02, PNorm = 367.7266, GNorm = 0.6743, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.081375
Epoch 1969
Loss = 6.6192e-03, PNorm = 367.7999, GNorm = 0.4775, lr_0 = 9.9897e-04
Loss = 1.6530e-02, PNorm = 367.8605, GNorm = 0.0279, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.077730
Epoch 1970
Loss = 2.7444e-02, PNorm = 367.9347, GNorm = 1.6437, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.086081
Epoch 1971
Loss = 3.6190e-02, PNorm = 368.0260, GNorm = 0.0693, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.072417
Epoch 1972
Loss = 1.0110e-02, PNorm = 368.1109, GNorm = 0.0433, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.088895
Epoch 1973
Loss = 2.1622e-02, PNorm = 368.1869, GNorm = 0.2041, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.086303
Epoch 1974
Loss = 6.0735e-02, PNorm = 368.2684, GNorm = 1.5493, lr_0 = 9.9897e-04
Validation binary_cross_entropy = 0.080125
Epoch 1975
Loss = 1.6768e-02, PNorm = 368.3933, GNorm = 0.1935, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.066911
Epoch 1976
Loss = 6.1363e-02, PNorm = 368.5338, GNorm = 0.9725, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.051044
Epoch 1977
Loss = 1.1625e-02, PNorm = 368.6901, GNorm = 1.0780, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.073771
Epoch 1978
Loss = 1.9003e-02, PNorm = 368.8829, GNorm = 0.5896, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.346766
Epoch 1979
Loss = 8.8860e-04, PNorm = 369.0498, GNorm = 0.0450, lr_0 = 9.9896e-04
Loss = 5.7555e-02, PNorm = 369.1868, GNorm = 0.0942, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.263216
Epoch 1980
Loss = 7.5258e-02, PNorm = 369.3552, GNorm = 2.9086, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.081139
Epoch 1981
Loss = 2.2875e-02, PNorm = 369.5246, GNorm = 0.3031, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.080571
Epoch 1982
Loss = 2.7346e-02, PNorm = 369.6836, GNorm = 0.7285, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.082098
Epoch 1983
Loss = 4.3405e-02, PNorm = 369.8223, GNorm = 0.8173, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.086685
Epoch 1984
Loss = 3.9442e-02, PNorm = 369.9157, GNorm = 0.2937, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.047817
Epoch 1985
Loss = 2.9274e-02, PNorm = 370.0179, GNorm = 0.4665, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.066474
Epoch 1986
Loss = 8.8345e-03, PNorm = 370.1199, GNorm = 1.5496, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.069124
Epoch 1987
Loss = 6.3207e-03, PNorm = 370.1947, GNorm = 0.0476, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.060304
Epoch 1988
Loss = 4.9828e-03, PNorm = 370.2818, GNorm = 0.0629, lr_0 = 9.9896e-04
Validation binary_cross_entropy = 0.072407
Epoch 1989
Loss = 3.1077e-03, PNorm = 370.3686, GNorm = 0.1038, lr_0 = 9.9895e-04
Loss = 3.0181e-02, PNorm = 370.4379, GNorm = 2.9928, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.062451
Epoch 1990
Loss = 3.3872e-02, PNorm = 370.5125, GNorm = 0.5531, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.053821
Epoch 1991
Loss = 2.1061e-02, PNorm = 370.5842, GNorm = 0.1749, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.053119
Epoch 1992
Loss = 3.0300e-02, PNorm = 370.6492, GNorm = 0.0827, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.054563
Epoch 1993
Loss = 2.1984e-02, PNorm = 370.7159, GNorm = 2.4206, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.052077
Epoch 1994
Loss = 4.1970e-02, PNorm = 370.7900, GNorm = 1.5417, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.054847
Epoch 1995
Loss = 5.8098e-03, PNorm = 370.8632, GNorm = 0.3239, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.062044
Epoch 1996
Loss = 1.8657e-02, PNorm = 370.9290, GNorm = 0.2650, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.051529
Epoch 1997
Loss = 7.4756e-03, PNorm = 371.0030, GNorm = 0.6764, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.075284
Epoch 1998
Loss = 5.2852e-02, PNorm = 371.0730, GNorm = 1.3975, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.060742
Epoch 1999
Loss = 7.1429e-03, PNorm = 371.1154, GNorm = 0.2871, lr_0 = 9.9895e-04
Loss = 2.2690e-02, PNorm = 371.1838, GNorm = 0.0808, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.065982
Epoch 2000
Loss = 3.2788e-02, PNorm = 371.2644, GNorm = 0.1434, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.050440
Epoch 2001
Loss = 4.2173e-02, PNorm = 371.3461, GNorm = 0.7467, lr_0 = 9.9895e-04
Validation binary_cross_entropy = 0.053486
Epoch 2002
Loss = 3.7756e-02, PNorm = 371.4355, GNorm = 0.2491, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.046580
Epoch 2003
Loss = 1.6186e-02, PNorm = 371.5271, GNorm = 0.0671, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.075339
Epoch 2004
Loss = 2.6660e-02, PNorm = 371.5939, GNorm = 0.1720, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.047937
Epoch 2005
Loss = 1.2542e-02, PNorm = 371.6580, GNorm = 0.2306, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.058795
Epoch 2006
Loss = 2.3531e-03, PNorm = 371.7362, GNorm = 0.0736, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.073260
Epoch 2007
Loss = 7.2959e-04, PNorm = 371.7912, GNorm = 0.0208, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.084091
Epoch 2008
Loss = 3.1531e-02, PNorm = 371.8383, GNorm = 4.3447, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.085321
Epoch 2009
Loss = 4.1402e-02, PNorm = 371.9177, GNorm = 1.5485, lr_0 = 9.9894e-04
Loss = 3.3393e-02, PNorm = 372.0246, GNorm = 1.8117, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.084904
Epoch 2010
Loss = 1.6698e-02, PNorm = 372.1298, GNorm = 0.0398, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.072384
Epoch 2011
Loss = 1.1112e-02, PNorm = 372.2246, GNorm = 0.7860, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.080910
Epoch 2012
Loss = 4.2912e-03, PNorm = 372.3031, GNorm = 0.0831, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.064108
Epoch 2013
Loss = 3.0605e-02, PNorm = 372.3861, GNorm = 0.0539, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.063654
Epoch 2014
Loss = 1.8287e-02, PNorm = 372.4811, GNorm = 0.7733, lr_0 = 9.9894e-04
Validation binary_cross_entropy = 0.055982
Epoch 2015
Loss = 5.1269e-02, PNorm = 372.5759, GNorm = 0.3552, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.053041
Epoch 2016
Loss = 1.7131e-02, PNorm = 372.6777, GNorm = 0.9605, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.086930
Epoch 2017
Loss = 1.2673e-02, PNorm = 372.7720, GNorm = 0.1402, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.072943
Epoch 2018
Loss = 1.2101e-02, PNorm = 372.8504, GNorm = 0.0971, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.062724
Epoch 2019
Loss = 1.9130e-03, PNorm = 372.9107, GNorm = 0.0810, lr_0 = 9.9893e-04
Loss = 5.3798e-02, PNorm = 373.0016, GNorm = 0.3579, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.077057
Epoch 2020
Loss = 2.3495e-02, PNorm = 373.1116, GNorm = 1.1385, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.070405
Epoch 2021
Loss = 1.8825e-02, PNorm = 373.2150, GNorm = 0.5425, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.068863
Epoch 2022
Loss = 9.4896e-03, PNorm = 373.3075, GNorm = 0.1678, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.095605
Epoch 2023
Loss = 1.3131e-02, PNorm = 373.3909, GNorm = 1.9078, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.115993
Epoch 2024
Loss = 9.0016e-02, PNorm = 373.4496, GNorm = 7.1142, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.073184
Epoch 2025
Loss = 6.6750e-02, PNorm = 373.5992, GNorm = 0.6645, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.062165
Epoch 2026
Loss = 2.5747e-02, PNorm = 373.7783, GNorm = 0.8742, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.062728
Epoch 2027
Loss = 4.3575e-03, PNorm = 373.9319, GNorm = 0.0211, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.119895
Epoch 2028
Loss = 2.2758e-02, PNorm = 374.0318, GNorm = 1.7380, lr_0 = 9.9893e-04
Validation binary_cross_entropy = 0.099641
Epoch 2029
Loss = 4.1432e-02, PNorm = 374.1387, GNorm = 3.8006, lr_0 = 9.9892e-04
Loss = 5.5610e-02, PNorm = 374.2928, GNorm = 0.1430, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.147725
Epoch 2030
Loss = 4.8566e-02, PNorm = 374.4477, GNorm = 1.7241, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.058097
Epoch 2031
Loss = 3.1132e-02, PNorm = 374.6296, GNorm = 1.1705, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.111236
Epoch 2032
Loss = 4.5549e-02, PNorm = 374.7701, GNorm = 1.6222, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.057433
Epoch 2033
Loss = 5.0243e-02, PNorm = 374.9135, GNorm = 4.2855, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.073597
Epoch 2034
Loss = 2.0384e-02, PNorm = 375.0613, GNorm = 0.6058, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.078328
Epoch 2035
Loss = 3.1597e-02, PNorm = 375.1839, GNorm = 0.7657, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.060157
Epoch 2036
Loss = 1.9119e-02, PNorm = 375.3221, GNorm = 0.4326, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.069917
Epoch 2037
Loss = 1.3226e-02, PNorm = 375.4495, GNorm = 0.4075, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.071157
Epoch 2038
Loss = 1.4891e-02, PNorm = 375.5755, GNorm = 0.2599, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.091791
Epoch 2039
Loss = 1.2146e-03, PNorm = 375.6721, GNorm = 0.0553, lr_0 = 9.9892e-04
Loss = 5.6281e-02, PNorm = 375.7899, GNorm = 0.5280, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.106215
Epoch 2040
Loss = 3.4771e-02, PNorm = 375.9351, GNorm = 0.5388, lr_0 = 9.9892e-04
Validation binary_cross_entropy = 0.065177
Epoch 2041
Loss = 3.6590e-02, PNorm = 376.0739, GNorm = 0.6946, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.067097
Epoch 2042
Loss = 1.2775e-02, PNorm = 376.1729, GNorm = 0.1664, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.064610
Epoch 2043
Loss = 5.0176e-02, PNorm = 376.2441, GNorm = 2.1539, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.073668
Epoch 2044
Loss = 2.1231e-02, PNorm = 376.3381, GNorm = 0.5508, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.060032
Epoch 2045
Loss = 5.5951e-03, PNorm = 376.4200, GNorm = 0.1353, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.047927
Epoch 2046
Loss = 1.0827e-01, PNorm = 376.4855, GNorm = 0.6017, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.046481
Epoch 2047
Loss = 2.4407e-02, PNorm = 376.5754, GNorm = 0.1909, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.062343
Epoch 2048
Loss = 1.6470e-03, PNorm = 376.6699, GNorm = 0.0959, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.070463
Epoch 2049
Loss = 1.4520e-04, PNorm = 376.7334, GNorm = 0.0079, lr_0 = 9.9891e-04
Loss = 4.8731e-02, PNorm = 376.8072, GNorm = 0.3639, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.052122
Epoch 2050
Loss = 1.5997e-02, PNorm = 376.8997, GNorm = 1.4816, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.074180
Epoch 2051
Loss = 2.3585e-02, PNorm = 376.9799, GNorm = 0.5714, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.058788
Epoch 2052
Loss = 2.7635e-02, PNorm = 377.0785, GNorm = 1.7748, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.063094
Epoch 2053
Loss = 2.5101e-03, PNorm = 377.1734, GNorm = 0.0092, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.087030
Epoch 2054
Loss = 2.9974e-02, PNorm = 377.2375, GNorm = 0.6002, lr_0 = 9.9891e-04
Validation binary_cross_entropy = 0.075389
Epoch 2055
Loss = 7.7759e-03, PNorm = 377.3076, GNorm = 1.9875, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.072969
Epoch 2056
Loss = 1.0225e-02, PNorm = 377.3715, GNorm = 1.4978, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.062947
Epoch 2057
Loss = 2.2943e-02, PNorm = 377.4434, GNorm = 2.5338, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.079967
Epoch 2058
Loss = 1.4255e-03, PNorm = 377.5415, GNorm = 0.0834, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.072366
Epoch 2059
Loss = 3.8528e-03, PNorm = 377.6130, GNorm = 0.1238, lr_0 = 9.9890e-04
Loss = 4.9164e-02, PNorm = 377.6877, GNorm = 0.4575, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.056095
Epoch 2060
Loss = 2.2573e-02, PNorm = 377.7960, GNorm = 0.6131, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.069139
Epoch 2061
Loss = 4.0023e-02, PNorm = 377.8784, GNorm = 0.0627, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.073070
Epoch 2062
Loss = 9.6889e-03, PNorm = 377.9592, GNorm = 0.7208, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.068368
Epoch 2063
Loss = 4.2329e-02, PNorm = 378.0410, GNorm = 0.1030, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.077729
Epoch 2064
Loss = 2.7012e-02, PNorm = 378.1349, GNorm = 3.0820, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.069963
Epoch 2065
Loss = 2.4881e-02, PNorm = 378.2319, GNorm = 0.0343, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.065789
Epoch 2066
Loss = 6.4634e-03, PNorm = 378.3237, GNorm = 0.3951, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.075590
Epoch 2067
Loss = 3.5134e-02, PNorm = 378.4029, GNorm = 0.0140, lr_0 = 9.9890e-04
Validation binary_cross_entropy = 0.073234
Epoch 2068
Loss = 2.7899e-02, PNorm = 378.4667, GNorm = 0.7336, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.057106
Epoch 2069
Loss = 2.2166e-02, PNorm = 378.5461, GNorm = 1.3670, lr_0 = 9.9889e-04
Loss = 5.0562e-02, PNorm = 378.6550, GNorm = 0.0254, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.095287
Epoch 2070
Loss = 3.0853e-02, PNorm = 378.7698, GNorm = 1.4170, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.064970
Epoch 2071
Loss = 3.4055e-02, PNorm = 378.8750, GNorm = 1.2641, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.080600
Epoch 2072
Loss = 3.2242e-02, PNorm = 378.9796, GNorm = 0.6054, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.062581
Epoch 2073
Loss = 1.6726e-02, PNorm = 379.0644, GNorm = 1.2373, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.062700
Epoch 2074
Loss = 6.0199e-03, PNorm = 379.1325, GNorm = 0.1880, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.074136
Epoch 2075
Loss = 4.3940e-03, PNorm = 379.1815, GNorm = 0.0187, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.068935
Epoch 2076
Loss = 3.0600e-03, PNorm = 379.2477, GNorm = 0.0329, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.079508
Epoch 2077
Loss = 8.4023e-04, PNorm = 379.3109, GNorm = 0.0273, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.087802
Epoch 2078
Loss = 2.1116e-02, PNorm = 379.3559, GNorm = 2.0237, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.094933
Epoch 2079
Loss = 4.2593e-04, PNorm = 379.3917, GNorm = 0.0218, lr_0 = 9.9889e-04
Loss = 3.2672e-02, PNorm = 379.4392, GNorm = 2.6390, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.077945
Epoch 2080
Loss = 4.1960e-02, PNorm = 379.5066, GNorm = 0.1948, lr_0 = 9.9889e-04
Validation binary_cross_entropy = 0.076683
Epoch 2081
Loss = 4.1168e-02, PNorm = 379.6071, GNorm = 0.8747, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.065113
Epoch 2082
Loss = 2.5087e-02, PNorm = 379.7076, GNorm = 0.7073, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.053622
Epoch 2083
Loss = 1.3271e-02, PNorm = 379.8094, GNorm = 0.4030, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.069601
Epoch 2084
Loss = 2.6838e-02, PNorm = 379.8826, GNorm = 1.8018, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.084626
Epoch 2085
Loss = 3.4399e-03, PNorm = 379.9367, GNorm = 0.0112, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.071111
Epoch 2086
Loss = 2.0866e-02, PNorm = 379.9870, GNorm = 0.2719, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.073990
Epoch 2087
Loss = 4.0655e-03, PNorm = 380.0775, GNorm = 0.0441, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.092053
Epoch 2088
Loss = 4.4772e-03, PNorm = 380.1539, GNorm = 0.6746, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.093246
Epoch 2089
Loss = 3.4190e-04, PNorm = 380.2054, GNorm = 0.0158, lr_0 = 9.9888e-04
Loss = 4.8103e-02, PNorm = 380.2537, GNorm = 0.6897, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.069409
Epoch 2090
Loss = 2.3117e-02, PNorm = 380.3351, GNorm = 0.1398, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.080386
Epoch 2091
Loss = 2.7154e-02, PNorm = 380.4070, GNorm = 0.3861, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.070560
Epoch 2092
Loss = 1.1725e-02, PNorm = 380.4690, GNorm = 0.4886, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.070739
Epoch 2093
Loss = 1.2411e-02, PNorm = 380.5344, GNorm = 0.0542, lr_0 = 9.9888e-04
Validation binary_cross_entropy = 0.091805
Epoch 2094
Loss = 7.4693e-02, PNorm = 380.5946, GNorm = 4.2373, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.072163
Epoch 2095
Loss = 3.6854e-02, PNorm = 380.6670, GNorm = 0.1494, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.063659
Epoch 2096
Loss = 3.6169e-01, PNorm = 380.7523, GNorm = 0.8799, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.089067
Epoch 2097
Loss = 1.3343e-02, PNorm = 381.0178, GNorm = 1.0832, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.110457
Epoch 2098
Loss = 5.9379e-03, PNorm = 381.2085, GNorm = 0.0042, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.139795
Epoch 2099
Loss = 1.3150e-02, PNorm = 381.3108, GNorm = 0.7588, lr_0 = 9.9887e-04
Loss = 1.6647e-01, PNorm = 381.4004, GNorm = 2.7442, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.080537
Epoch 2100
Loss = 5.3277e-02, PNorm = 381.5178, GNorm = 0.9205, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.060948
Epoch 2101
Loss = 6.1141e-02, PNorm = 381.6413, GNorm = 0.9286, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.063086
Epoch 2102
Loss = 3.0079e-02, PNorm = 381.7546, GNorm = 0.2416, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.071066
Epoch 2103
Loss = 7.1843e-02, PNorm = 381.8361, GNorm = 2.1333, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.063332
Epoch 2104
Loss = 1.5295e-02, PNorm = 381.9285, GNorm = 0.6862, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.072428
Epoch 2105
Loss = 6.0702e-02, PNorm = 382.0066, GNorm = 0.1577, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.067829
Epoch 2106
Loss = 5.5013e-02, PNorm = 382.0708, GNorm = 0.9384, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.064115
Epoch 2107
Loss = 3.5095e-02, PNorm = 382.1413, GNorm = 1.4143, lr_0 = 9.9887e-04
Validation binary_cross_entropy = 0.070591
Epoch 2108
Loss = 3.5116e-03, PNorm = 382.2099, GNorm = 0.1281, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.115289
Epoch 2109
Loss = 1.2281e-04, PNorm = 382.2636, GNorm = 0.0077, lr_0 = 9.9886e-04
Loss = 1.6024e-02, PNorm = 382.2942, GNorm = 0.0119, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.115782
Epoch 2110
Loss = 2.3175e-02, PNorm = 382.3384, GNorm = 0.2816, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.098931
Epoch 2111
Loss = 1.1277e-02, PNorm = 382.3900, GNorm = 1.4590, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.090582
Epoch 2112
Loss = 1.7265e-02, PNorm = 382.4558, GNorm = 1.1436, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.116720
Epoch 2113
Loss = 2.9854e-02, PNorm = 382.5575, GNorm = 2.8967, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.258894
Epoch 2114
Loss = 5.1367e-02, PNorm = 382.6522, GNorm = 0.9381, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.110544
Epoch 2115
Loss = 3.8315e-02, PNorm = 382.7717, GNorm = 1.9960, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.078444
Epoch 2116
Loss = 5.5881e-02, PNorm = 382.9260, GNorm = 0.3435, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.098993
Epoch 2117
Loss = 1.9472e-01, PNorm = 383.0737, GNorm = 4.4526, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.069132
Epoch 2118
Loss = 2.8457e-02, PNorm = 383.2059, GNorm = 0.5329, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.094469
Epoch 2119
Loss = 3.0948e-02, PNorm = 383.3098, GNorm = 1.1207, lr_0 = 9.9886e-04
Loss = 2.7892e-02, PNorm = 383.3915, GNorm = 0.6125, lr_0 = 9.9886e-04
Validation binary_cross_entropy = 0.105885
Epoch 2120
Loss = 4.4906e-02, PNorm = 383.5031, GNorm = 1.6232, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.097880
Epoch 2121
Loss = 8.4269e-02, PNorm = 383.6269, GNorm = 0.4851, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.084366
Epoch 2122
Loss = 6.8866e-02, PNorm = 383.7486, GNorm = 3.1461, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.059597
Epoch 2123
Loss = 3.6130e-02, PNorm = 383.8781, GNorm = 0.6259, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.080186
Epoch 2124
Loss = 5.0757e-02, PNorm = 383.9920, GNorm = 0.4586, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.072022
Epoch 2125
Loss = 5.1413e-02, PNorm = 384.1001, GNorm = 0.0790, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.072472
Epoch 2126
Loss = 5.4751e-03, PNorm = 384.2027, GNorm = 0.3847, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.116180
Epoch 2127
Loss = 2.2277e-02, PNorm = 384.2992, GNorm = 1.4126, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.073102
Epoch 2128
Loss = 9.6177e-02, PNorm = 384.4217, GNorm = 0.1607, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.071795
Epoch 2129
Loss = 1.9692e-02, PNorm = 384.5825, GNorm = 0.5541, lr_0 = 9.9885e-04
Loss = 2.3509e-02, PNorm = 384.7263, GNorm = 1.0153, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.084154
Epoch 2130
Loss = 1.3813e-02, PNorm = 384.8291, GNorm = 0.7386, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.087152
Epoch 2131
Loss = 1.5812e-02, PNorm = 384.9052, GNorm = 1.1436, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.083247
Epoch 2132
Loss = 4.0135e-02, PNorm = 384.9746, GNorm = 2.5014, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.083235
Epoch 2133
Loss = 1.5128e-02, PNorm = 385.0653, GNorm = 0.7181, lr_0 = 9.9885e-04
Validation binary_cross_entropy = 0.119460
Epoch 2134
Loss = 4.0839e-02, PNorm = 385.1497, GNorm = 2.0540, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.069834
Epoch 2135
Loss = 4.8204e-02, PNorm = 385.2569, GNorm = 4.5620, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.068385
Epoch 2136
Loss = 5.5338e-02, PNorm = 385.4001, GNorm = 1.0797, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.080842
Epoch 2137
Loss = 7.1319e-03, PNorm = 385.5235, GNorm = 0.3928, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.063979
Epoch 2138
Loss = 6.8748e-02, PNorm = 385.6181, GNorm = 0.8350, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.059296
Epoch 2139
Loss = 7.8553e-03, PNorm = 385.7056, GNorm = 0.3092, lr_0 = 9.9884e-04
Loss = 1.2050e-02, PNorm = 385.7891, GNorm = 0.7419, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.085346
Epoch 2140
Loss = 2.1260e-02, PNorm = 385.8784, GNorm = 0.0787, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.083854
Epoch 2141
Loss = 1.3800e-02, PNorm = 385.9472, GNorm = 0.0332, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.089105
Epoch 2142
Loss = 4.1020e-03, PNorm = 386.0114, GNorm = 0.2349, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.101704
Epoch 2143
Loss = 5.8951e-02, PNorm = 386.0773, GNorm = 0.1255, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.087860
Epoch 2144
Loss = 1.0283e-01, PNorm = 386.1990, GNorm = 2.7594, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.049782
Epoch 2145
Loss = 1.5368e-01, PNorm = 386.3818, GNorm = 0.9036, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.077566
Epoch 2146
Loss = 5.2792e-02, PNorm = 386.5906, GNorm = 1.6245, lr_0 = 9.9884e-04
Validation binary_cross_entropy = 0.106553
Epoch 2147
Loss = 2.4939e-02, PNorm = 386.7424, GNorm = 1.4937, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.096800
Epoch 2148
Loss = 2.6892e-02, PNorm = 386.8496, GNorm = 4.8346, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.109371
Epoch 2149
Loss = 2.5312e-01, PNorm = 386.9703, GNorm = 4.9942, lr_0 = 9.9883e-04
Loss = 2.3101e-02, PNorm = 387.0679, GNorm = 0.1938, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.076384
Epoch 2150
Loss = 3.5307e-02, PNorm = 387.1787, GNorm = 0.2274, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.079050
Epoch 2151
Loss = 3.1043e-02, PNorm = 387.2850, GNorm = 0.1600, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.083990
Epoch 2152
Loss = 8.5296e-03, PNorm = 387.3686, GNorm = 1.4005, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.102796
Epoch 2153
Loss = 4.7892e-02, PNorm = 387.4495, GNorm = 2.9215, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.087305
Epoch 2154
Loss = 7.0000e-02, PNorm = 387.5783, GNorm = 0.0975, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.134894
Epoch 2155
Loss = 4.4940e-02, PNorm = 387.7194, GNorm = 1.6001, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.068836
Epoch 2156
Loss = 1.2725e-02, PNorm = 387.8412, GNorm = 0.6267, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.091727
Epoch 2157
Loss = 1.7243e-02, PNorm = 387.9415, GNorm = 0.6851, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.142159
Epoch 2158
Loss = 7.2132e-03, PNorm = 388.0274, GNorm = 0.2133, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.080436
Epoch 2159
Loss = 9.5938e-03, PNorm = 388.1003, GNorm = 0.3803, lr_0 = 9.9883e-04
Loss = 8.3132e-02, PNorm = 388.1887, GNorm = 3.8738, lr_0 = 9.9883e-04
Validation binary_cross_entropy = 0.076756
Epoch 2160
Loss = 6.1449e-02, PNorm = 388.3054, GNorm = 0.3568, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.061560
Epoch 2161
Loss = 4.0366e-02, PNorm = 388.4211, GNorm = 0.1028, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.068428
Epoch 2162
Loss = 4.4584e-02, PNorm = 388.5147, GNorm = 0.0208, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.067152
Epoch 2163
Loss = 5.4543e-02, PNorm = 388.5944, GNorm = 0.5462, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.058083
Epoch 2164
Loss = 1.8269e-02, PNorm = 388.7035, GNorm = 0.1374, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.080160
Epoch 2165
Loss = 9.8015e-03, PNorm = 388.7965, GNorm = 0.0217, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.071938
Epoch 2166
Loss = 4.0807e-02, PNorm = 388.8577, GNorm = 0.9815, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.064778
Epoch 2167
Loss = 4.7631e-03, PNorm = 388.9313, GNorm = 0.6402, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.092331
Epoch 2168
Loss = 1.9809e-03, PNorm = 388.9958, GNorm = 0.0340, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.065090
Epoch 2169
Loss = 6.6216e-04, PNorm = 389.0675, GNorm = 0.0280, lr_0 = 9.9882e-04
Loss = 3.2881e-02, PNorm = 389.1544, GNorm = 0.5426, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.067854
Epoch 2170
Loss = 1.6524e-02, PNorm = 389.2362, GNorm = 0.0696, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.069860
Epoch 2171
Loss = 5.9254e-02, PNorm = 389.3139, GNorm = 0.3649, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.054830
Epoch 2172
Loss = 3.9835e-02, PNorm = 389.4297, GNorm = 0.2754, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.063075
Epoch 2173
Loss = 2.7996e-02, PNorm = 389.5461, GNorm = 1.7076, lr_0 = 9.9882e-04
Validation binary_cross_entropy = 0.068516
Epoch 2174
Loss = 2.6174e-02, PNorm = 389.6408, GNorm = 3.2154, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.092185
Epoch 2175
Loss = 2.6899e-02, PNorm = 389.7369, GNorm = 0.2483, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.086026
Epoch 2176
Loss = 1.0197e-02, PNorm = 389.8106, GNorm = 0.7296, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.078852
Epoch 2177
Loss = 1.9600e-02, PNorm = 389.8828, GNorm = 0.1266, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.088067
Epoch 2178
Loss = 2.4969e-02, PNorm = 389.9559, GNorm = 1.3964, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.083765
Epoch 2179
Loss = 2.6655e-02, PNorm = 390.0547, GNorm = 1.0783, lr_0 = 9.9881e-04
Loss = 2.5519e-02, PNorm = 390.1422, GNorm = 0.4720, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.069963
Epoch 2180
Loss = 4.8142e-02, PNorm = 390.2310, GNorm = 2.9021, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.079687
Epoch 2181
Loss = 2.4902e-02, PNorm = 390.3290, GNorm = 0.2213, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.071661
Epoch 2182
Loss = 2.4074e-02, PNorm = 390.4116, GNorm = 0.3243, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.071108
Epoch 2183
Loss = 9.7303e-03, PNorm = 390.4828, GNorm = 0.7406, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.092473
Epoch 2184
Loss = 3.7174e-03, PNorm = 390.5313, GNorm = 0.0185, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.081422
Epoch 2185
Loss = 3.5698e-03, PNorm = 390.5766, GNorm = 0.4802, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.084404
Epoch 2186
Loss = 3.7690e-02, PNorm = 390.6177, GNorm = 5.1597, lr_0 = 9.9881e-04
Validation binary_cross_entropy = 0.124439
Epoch 2187
Loss = 3.7625e-02, PNorm = 390.6680, GNorm = 1.3644, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.093321
Epoch 2188
Loss = 1.5297e-02, PNorm = 390.7469, GNorm = 0.1403, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.072225
Epoch 2189
Loss = 1.6581e-01, PNorm = 390.8823, GNorm = 3.1053, lr_0 = 9.9880e-04
Loss = 4.7136e-02, PNorm = 391.0483, GNorm = 0.6596, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.038162
Epoch 2190
Loss = 3.6748e-02, PNorm = 391.2031, GNorm = 1.6019, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.083473
Epoch 2191
Loss = 1.8940e-02, PNorm = 391.3084, GNorm = 0.3304, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.092377
Epoch 2192
Loss = 1.8535e-02, PNorm = 391.3769, GNorm = 0.0948, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.081157
Epoch 2193
Loss = 1.6539e-02, PNorm = 391.4292, GNorm = 0.0642, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.099333
Epoch 2194
Loss = 1.0968e-03, PNorm = 391.4810, GNorm = 0.0248, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.125834
Epoch 2195
Loss = 8.3369e-03, PNorm = 391.5113, GNorm = 0.0450, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.086700
Epoch 2196
Loss = 6.7989e-03, PNorm = 391.5465, GNorm = 0.2196, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.075234
Epoch 2197
Loss = 3.1166e-02, PNorm = 391.6125, GNorm = 0.0312, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.101905
Epoch 2198
Loss = 6.2101e-03, PNorm = 391.7053, GNorm = 1.1351, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.148932
Epoch 2199
Loss = 5.9608e-02, PNorm = 391.7852, GNorm = 2.1861, lr_0 = 9.9880e-04
Loss = 1.3593e-02, PNorm = 391.8510, GNorm = 2.3157, lr_0 = 9.9880e-04
Validation binary_cross_entropy = 0.076665
Epoch 2200
Loss = 4.1916e-02, PNorm = 391.9187, GNorm = 0.0321, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.078303
Epoch 2201
Loss = 1.8568e-02, PNorm = 391.9764, GNorm = 0.0373, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.079140
Epoch 2202
Loss = 8.0821e-03, PNorm = 392.0221, GNorm = 0.1939, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.080973
Epoch 2203
Loss = 6.9113e-03, PNorm = 392.0642, GNorm = 0.3151, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.098872
Epoch 2204
Loss = 7.6479e-03, PNorm = 392.1007, GNorm = 0.1087, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.095804
Epoch 2205
Loss = 7.8029e-03, PNorm = 392.1344, GNorm = 0.0081, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.112192
Epoch 2206
Loss = 7.3182e-03, PNorm = 392.1778, GNorm = 0.0362, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.092122
Epoch 2207
Loss = 6.4005e-03, PNorm = 392.2200, GNorm = 0.0021, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.154901
Epoch 2208
Loss = 8.9149e-04, PNorm = 392.2793, GNorm = 0.0137, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.054126
Epoch 2209
Loss = 1.1104e-01, PNorm = 392.4189, GNorm = 1.8849, lr_0 = 9.9879e-04
Loss = 9.0954e-02, PNorm = 392.6563, GNorm = 0.3029, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.086213
Epoch 2210
Loss = 6.3709e-02, PNorm = 392.8513, GNorm = 1.2325, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.057749
Epoch 2211
Loss = 6.7577e-02, PNorm = 393.0164, GNorm = 7.9393, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.095696
Epoch 2212
Loss = 5.5610e-02, PNorm = 393.1774, GNorm = 2.0280, lr_0 = 9.9879e-04
Validation binary_cross_entropy = 0.069681
Epoch 2213
Loss = 5.0576e-02, PNorm = 393.3258, GNorm = 0.2059, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.078423
Epoch 2214
Loss = 1.2856e-02, PNorm = 393.4473, GNorm = 0.1876, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.073681
Epoch 2215
Loss = 6.3256e-02, PNorm = 393.5572, GNorm = 2.2327, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.085763
Epoch 2216
Loss = 2.5802e-02, PNorm = 393.6724, GNorm = 0.3883, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.099291
Epoch 2217
Loss = 1.3163e-02, PNorm = 393.7958, GNorm = 1.6800, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.087973
Epoch 2218
Loss = 6.3178e-02, PNorm = 393.8906, GNorm = 0.5772, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.087274
Epoch 2219
Loss = 3.2082e-03, PNorm = 393.9868, GNorm = 0.0888, lr_0 = 9.9878e-04
Loss = 1.2460e-02, PNorm = 394.0698, GNorm = 0.1469, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.101604
Epoch 2220
Loss = 1.7838e-02, PNorm = 394.1400, GNorm = 0.0092, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.109067
Epoch 2221
Loss = 1.0937e-02, PNorm = 394.2163, GNorm = 2.2724, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.202108
Epoch 2222
Loss = 8.7408e-03, PNorm = 394.2851, GNorm = 1.9256, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.126372
Epoch 2223
Loss = 8.1738e-02, PNorm = 394.3883, GNorm = 0.0343, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.236530
Epoch 2224
Loss = 2.0624e-02, PNorm = 394.4974, GNorm = 1.9543, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.085418
Epoch 2225
Loss = 8.0607e-02, PNorm = 394.6308, GNorm = 3.7512, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.090761
Epoch 2226
Loss = 4.7780e-02, PNorm = 394.8230, GNorm = 0.6166, lr_0 = 9.9878e-04
Validation binary_cross_entropy = 0.081378
Epoch 2227
Loss = 4.2211e-02, PNorm = 394.9692, GNorm = 1.1787, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.079880
Epoch 2228
Loss = 4.3730e-02, PNorm = 395.0649, GNorm = 1.5907, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.092609
Epoch 2229
Loss = 4.7050e-03, PNorm = 395.1549, GNorm = 0.1978, lr_0 = 9.9877e-04
Loss = 1.9635e-02, PNorm = 395.2369, GNorm = 0.0241, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.096397
Epoch 2230
Loss = 7.4483e-02, PNorm = 395.3309, GNorm = 0.3121, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.056361
Epoch 2231
Loss = 1.5631e-02, PNorm = 395.4617, GNorm = 0.3667, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.118489
Epoch 2232
Loss = 1.4036e-02, PNorm = 395.5502, GNorm = 0.0490, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.099866
Epoch 2233
Loss = 6.0147e-03, PNorm = 395.6113, GNorm = 0.2782, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.089995
Epoch 2234
Loss = 3.4292e-02, PNorm = 395.6781, GNorm = 0.8403, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.075465
Epoch 2235
Loss = 8.0146e-02, PNorm = 395.7709, GNorm = 1.0256, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.074998
Epoch 2236
Loss = 9.8402e-03, PNorm = 395.8762, GNorm = 0.1564, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.096256
Epoch 2237
Loss = 1.9445e-02, PNorm = 395.9399, GNorm = 0.1696, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.066292
Epoch 2238
Loss = 3.9162e-02, PNorm = 396.0193, GNorm = 2.6045, lr_0 = 9.9877e-04
Validation binary_cross_entropy = 0.081869
Epoch 2239
Loss = 1.0129e-02, PNorm = 396.1065, GNorm = 0.4748, lr_0 = 9.9877e-04
Loss = 2.3589e-02, PNorm = 396.1905, GNorm = 0.3261, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.073220
Epoch 2240
Loss = 5.7054e-02, PNorm = 396.2716, GNorm = 1.5100, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.055264
Epoch 2241
Loss = 3.6800e-02, PNorm = 396.3826, GNorm = 0.2692, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.055585
Epoch 2242
Loss = 1.9303e-02, PNorm = 396.4976, GNorm = 1.4280, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.074331
Epoch 2243
Loss = 4.9533e-02, PNorm = 396.5983, GNorm = 0.1488, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.055476
Epoch 2244
Loss = 4.0608e-02, PNorm = 396.6926, GNorm = 0.2012, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.067779
Epoch 2245
Loss = 5.1439e-03, PNorm = 396.7864, GNorm = 0.0782, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.077428
Epoch 2246
Loss = 2.5379e-02, PNorm = 396.8703, GNorm = 1.2862, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.090521
Epoch 2247
Loss = 2.8880e-03, PNorm = 396.9475, GNorm = 0.5146, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.088313
Epoch 2248
Loss = 1.2646e-01, PNorm = 397.0073, GNorm = 4.1631, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.098765
Epoch 2249
Loss = 1.1137e-02, PNorm = 397.1150, GNorm = 0.9801, lr_0 = 9.9876e-04
Loss = 1.3125e-02, PNorm = 397.2236, GNorm = 0.2443, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.102904
Epoch 2250
Loss = 6.4536e-03, PNorm = 397.2988, GNorm = 0.0016, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.123992
Epoch 2251
Loss = 4.7401e-02, PNorm = 397.3674, GNorm = 0.0825, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.146454
Epoch 2252
Loss = 1.6570e-02, PNorm = 397.4397, GNorm = 0.0162, lr_0 = 9.9876e-04
Validation binary_cross_entropy = 0.190592
Epoch 2253
Loss = 3.7146e-02, PNorm = 397.5092, GNorm = 0.0236, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.094866
Epoch 2254
Loss = 7.0697e-02, PNorm = 397.6027, GNorm = 0.6855, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.086446
Epoch 2255
Loss = 5.4959e-02, PNorm = 397.7326, GNorm = 0.8995, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.074335
Epoch 2256
Loss = 1.0531e-02, PNorm = 397.8643, GNorm = 0.7492, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.210045
Epoch 2257
Loss = 7.0476e-02, PNorm = 397.9995, GNorm = 2.5025, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.153505
Epoch 2258
Loss = 5.8133e-02, PNorm = 398.1796, GNorm = 4.2294, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.157752
Epoch 2259
Loss = 1.3494e-02, PNorm = 398.3938, GNorm = 0.6163, lr_0 = 9.9875e-04
Loss = 5.1456e-02, PNorm = 398.5863, GNorm = 0.6339, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.087741
Epoch 2260
Loss = 5.1089e-02, PNorm = 398.7460, GNorm = 0.4427, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.092995
Epoch 2261
Loss = 3.7782e-02, PNorm = 398.8565, GNorm = 1.3497, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.094463
Epoch 2262
Loss = 5.8705e-02, PNorm = 398.9350, GNorm = 6.3812, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.091481
Epoch 2263
Loss = 3.8675e-02, PNorm = 399.0329, GNorm = 2.1532, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.092868
Epoch 2264
Loss = 3.0214e-02, PNorm = 399.1249, GNorm = 0.6337, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.118134
Epoch 2265
Loss = 3.8156e-02, PNorm = 399.2054, GNorm = 1.3282, lr_0 = 9.9875e-04
Validation binary_cross_entropy = 0.081289
Epoch 2266
Loss = 5.5612e-02, PNorm = 399.2820, GNorm = 0.9518, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.098654
Epoch 2267
Loss = 1.0157e-01, PNorm = 399.3611, GNorm = 0.0696, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.095722
Epoch 2268
Loss = 1.5290e-02, PNorm = 399.4411, GNorm = 0.7682, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.087777
Epoch 2269
Loss = 5.9272e-02, PNorm = 399.5055, GNorm = 0.7312, lr_0 = 9.9874e-04
Loss = 3.6643e-02, PNorm = 399.5975, GNorm = 0.5688, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.097072
Epoch 2270
Loss = 2.5058e-02, PNorm = 399.6966, GNorm = 0.4839, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.107985
Epoch 2271
Loss = 7.5892e-02, PNorm = 399.7739, GNorm = 8.5542, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.101425
Epoch 2272
Loss = 1.5172e-02, PNorm = 399.8610, GNorm = 0.1216, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.118206
Epoch 2273
Loss = 3.8953e-02, PNorm = 399.9355, GNorm = 1.4650, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.104973
Epoch 2274
Loss = 2.6320e-02, PNorm = 400.0129, GNorm = 0.9175, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.086511
Epoch 2275
Loss = 2.6796e-02, PNorm = 400.0985, GNorm = 0.4326, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.089486
Epoch 2276
Loss = 1.0051e-01, PNorm = 400.1936, GNorm = 2.5151, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.061272
Epoch 2277
Loss = 3.7020e-02, PNorm = 400.2747, GNorm = 0.4059, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.064749
Epoch 2278
Loss = 5.7602e-02, PNorm = 400.3614, GNorm = 0.8149, lr_0 = 9.9874e-04
Validation binary_cross_entropy = 0.101852
Epoch 2279
Loss = 1.5675e-02, PNorm = 400.4584, GNorm = 0.4034, lr_0 = 9.9874e-04
Loss = 2.4621e-02, PNorm = 400.5280, GNorm = 0.1915, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.083739
Epoch 2280
Loss = 1.3957e-02, PNorm = 400.5942, GNorm = 0.4830, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.104695
Epoch 2281
Loss = 2.5891e-02, PNorm = 400.6522, GNorm = 1.6243, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.094397
Epoch 2282
Loss = 3.0948e-02, PNorm = 400.7290, GNorm = 0.3804, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.080409
Epoch 2283
Loss = 1.8433e-02, PNorm = 400.8394, GNorm = 0.1620, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.104737
Epoch 2284
Loss = 2.9363e-03, PNorm = 400.9518, GNorm = 0.0847, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.090348
Epoch 2285
Loss = 1.5265e-02, PNorm = 401.0287, GNorm = 3.3101, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.063325
Epoch 2286
Loss = 2.7763e-02, PNorm = 401.1239, GNorm = 0.3810, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.087678
Epoch 2287
Loss = 5.5831e-02, PNorm = 401.2395, GNorm = 0.8904, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.069424
Epoch 2288
Loss = 1.7080e-02, PNorm = 401.3566, GNorm = 0.1881, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.057638
Epoch 2289
Loss = 3.9938e-02, PNorm = 401.4869, GNorm = 0.6783, lr_0 = 9.9873e-04
Loss = 2.9778e-02, PNorm = 401.5967, GNorm = 0.0774, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.069524
Epoch 2290
Loss = 4.3776e-02, PNorm = 401.6778, GNorm = 0.1127, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.051615
Epoch 2291
Loss = 5.0262e-02, PNorm = 401.7869, GNorm = 1.9635, lr_0 = 9.9873e-04
Validation binary_cross_entropy = 0.104176
Epoch 2292
Loss = 3.9492e-02, PNorm = 401.9144, GNorm = 1.3787, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.078772
Epoch 2293
Loss = 3.0945e-02, PNorm = 402.0129, GNorm = 0.5209, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.060849
Epoch 2294
Loss = 5.7000e-02, PNorm = 402.1193, GNorm = 0.4141, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.059892
Epoch 2295
Loss = 1.5971e-02, PNorm = 402.2484, GNorm = 0.0857, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.076921
Epoch 2296
Loss = 4.4535e-02, PNorm = 402.3480, GNorm = 1.4104, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.071349
Epoch 2297
Loss = 8.3857e-03, PNorm = 402.4458, GNorm = 0.2534, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.083006
Epoch 2298
Loss = 3.4821e-02, PNorm = 402.5275, GNorm = 0.2089, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.069341
Epoch 2299
Loss = 4.9608e-02, PNorm = 402.5924, GNorm = 1.7107, lr_0 = 9.9872e-04
Loss = 3.6909e-02, PNorm = 402.6524, GNorm = 0.0472, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.062120
Epoch 2300
Loss = 1.6086e-02, PNorm = 402.7248, GNorm = 0.0780, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.080510
Epoch 2301
Loss = 4.1092e-02, PNorm = 402.7942, GNorm = 1.3041, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.066169
Epoch 2302
Loss = 3.2947e-02, PNorm = 402.8694, GNorm = 1.8962, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.061472
Epoch 2303
Loss = 1.7058e-02, PNorm = 402.9470, GNorm = 0.4167, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.054630
Epoch 2304
Loss = 1.8634e-02, PNorm = 403.0320, GNorm = 0.1400, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.078626
Epoch 2305
Loss = 2.5420e-02, PNorm = 403.1069, GNorm = 0.3062, lr_0 = 9.9872e-04
Validation binary_cross_entropy = 0.069728
Epoch 2306
Loss = 3.1769e-03, PNorm = 403.1742, GNorm = 0.0755, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.060776
Epoch 2307
Loss = 5.9897e-02, PNorm = 403.2579, GNorm = 2.5722, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.059028
Epoch 2308
Loss = 1.0497e-02, PNorm = 403.3596, GNorm = 0.5366, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.056683
Epoch 2309
Loss = 2.4660e-03, PNorm = 403.4425, GNorm = 0.0825, lr_0 = 9.9871e-04
Loss = 4.6245e-02, PNorm = 403.5197, GNorm = 1.8574, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.042556
Epoch 2310
Loss = 1.7188e-02, PNorm = 403.6149, GNorm = 0.2441, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.079260
Epoch 2311
Loss = 9.5278e-03, PNorm = 403.6807, GNorm = 0.3449, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.082026
Epoch 2312
Loss = 2.7997e-02, PNorm = 403.7306, GNorm = 1.3749, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.061989
Epoch 2313
Loss = 3.1148e-02, PNorm = 403.8019, GNorm = 1.7094, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.060413
Epoch 2314
Loss = 7.0377e-02, PNorm = 403.8633, GNorm = 0.7430, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.041150
Epoch 2315
Loss = 2.5088e-02, PNorm = 403.9309, GNorm = 0.1568, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.046889
Epoch 2316
Loss = 3.3880e-02, PNorm = 404.0169, GNorm = 0.1523, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.066980
Epoch 2317
Loss = 3.2076e-02, PNorm = 404.0780, GNorm = 3.5900, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.062218
Epoch 2318
Loss = 6.6682e-03, PNorm = 404.1340, GNorm = 0.0285, lr_0 = 9.9871e-04
Validation binary_cross_entropy = 0.065221
Epoch 2319
Loss = 4.9105e-03, PNorm = 404.1993, GNorm = 0.2393, lr_0 = 9.9871e-04
Loss = 3.4908e-02, PNorm = 404.2569, GNorm = 0.1021, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.061085
Epoch 2320
Loss = 7.8661e-03, PNorm = 404.3189, GNorm = 0.2616, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.063754
Epoch 2321
Loss = 1.3322e-02, PNorm = 404.3785, GNorm = 0.1544, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.066695
Epoch 2322
Loss = 2.8114e-02, PNorm = 404.4236, GNorm = 1.7495, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.058944
Epoch 2323
Loss = 3.2404e-02, PNorm = 404.4871, GNorm = 0.1101, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.068448
Epoch 2324
Loss = 4.4742e-03, PNorm = 404.5553, GNorm = 0.0261, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.075199
Epoch 2325
Loss = 1.5294e-03, PNorm = 404.6093, GNorm = 0.0998, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.061042
Epoch 2326
Loss = 8.4392e-03, PNorm = 404.6622, GNorm = 0.5666, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.071095
Epoch 2327
Loss = 1.4912e-03, PNorm = 404.7302, GNorm = 0.0690, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.071458
Epoch 2328
Loss = 1.6363e-02, PNorm = 404.7940, GNorm = 0.5389, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.080300
Epoch 2329
Loss = 2.9977e-02, PNorm = 404.8717, GNorm = 1.8611, lr_0 = 9.9870e-04
Loss = 5.5054e-03, PNorm = 404.9348, GNorm = 0.0263, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.082559
Epoch 2330
Loss = 3.5017e-02, PNorm = 404.9891, GNorm = 0.3062, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.058716
Epoch 2331
Loss = 2.8338e-02, PNorm = 405.0636, GNorm = 0.3179, lr_0 = 9.9870e-04
Validation binary_cross_entropy = 0.063116
Epoch 2332
Loss = 9.2332e-03, PNorm = 405.1302, GNorm = 0.1902, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.087775
Epoch 2333
Loss = 3.5796e-02, PNorm = 405.1880, GNorm = 0.3213, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.065781
Epoch 2334
Loss = 4.1998e-02, PNorm = 405.2542, GNorm = 0.1210, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.058029
Epoch 2335
Loss = 1.9363e-02, PNorm = 405.3190, GNorm = 0.2247, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.077288
Epoch 2336
Loss = 1.2131e-02, PNorm = 405.3839, GNorm = 0.1254, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.085374
Epoch 2337
Loss = 3.8514e-02, PNorm = 405.4314, GNorm = 1.2142, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.068685
Epoch 2338
Loss = 1.3745e-03, PNorm = 405.4866, GNorm = 0.0797, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.054003
Epoch 2339
Loss = 3.7533e-03, PNorm = 405.5597, GNorm = 0.1089, lr_0 = 9.9869e-04
Loss = 1.5651e-02, PNorm = 405.6761, GNorm = 0.0673, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.092038
Epoch 2340
Loss = 1.4823e-02, PNorm = 405.7558, GNorm = 1.0576, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.148730
Epoch 2341
Loss = 1.2196e-02, PNorm = 405.8063, GNorm = 0.0491, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.073068
Epoch 2342
Loss = 2.7850e-02, PNorm = 405.8703, GNorm = 3.9343, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.081369
Epoch 2343
Loss = 3.4238e-02, PNorm = 405.9572, GNorm = 0.9887, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.055625
Epoch 2344
Loss = 3.1624e-02, PNorm = 406.0442, GNorm = 1.0008, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.053026
Epoch 2345
Loss = 1.2656e-02, PNorm = 406.1720, GNorm = 0.1918, lr_0 = 9.9869e-04
Validation binary_cross_entropy = 0.105688
Epoch 2346
Loss = 3.5727e-02, PNorm = 406.2713, GNorm = 2.3088, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.078170
Epoch 2347
Loss = 2.6718e-03, PNorm = 406.3341, GNorm = 0.2246, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.068546
Epoch 2348
Loss = 2.1336e-02, PNorm = 406.3841, GNorm = 0.2460, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.060181
Epoch 2349
Loss = 1.4620e-01, PNorm = 406.4656, GNorm = 1.5841, lr_0 = 9.9868e-04
Loss = 2.2597e-02, PNorm = 406.5634, GNorm = 0.0956, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.061679
Epoch 2350
Loss = 1.3609e-02, PNorm = 406.6413, GNorm = 0.7612, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.068739
Epoch 2351
Loss = 1.3912e-02, PNorm = 406.7006, GNorm = 0.0927, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.069665
Epoch 2352
Loss = 2.6341e-03, PNorm = 406.7527, GNorm = 1.0599, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.077342
Epoch 2353
Loss = 1.5617e-02, PNorm = 406.7880, GNorm = 0.0879, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.070025
Epoch 2354
Loss = 2.6540e-02, PNorm = 406.8379, GNorm = 0.1494, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.071835
Epoch 2355
Loss = 8.4361e-03, PNorm = 406.9016, GNorm = 2.2613, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.080567
Epoch 2356
Loss = 3.2779e-02, PNorm = 406.9472, GNorm = 3.2903, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.065004
Epoch 2357
Loss = 3.5460e-03, PNorm = 407.0015, GNorm = 0.1025, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.067907
Epoch 2358
Loss = 6.5782e-02, PNorm = 407.0842, GNorm = 1.4630, lr_0 = 9.9868e-04
Validation binary_cross_entropy = 0.075609
Epoch 2359
Loss = 1.5254e-03, PNorm = 407.2340, GNorm = 0.0491, lr_0 = 9.9867e-04
Loss = 3.2773e-02, PNorm = 407.3859, GNorm = 0.2950, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.061630
Epoch 2360
Loss = 4.3554e-02, PNorm = 407.5160, GNorm = 1.3337, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.102005
Epoch 2361
Loss = 7.1267e-02, PNorm = 407.6712, GNorm = 0.2837, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.056664
Epoch 2362
Loss = 2.9344e-02, PNorm = 407.8339, GNorm = 0.1620, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.058454
Epoch 2363
Loss = 4.3046e-02, PNorm = 407.9568, GNorm = 0.6099, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.074163
Epoch 2364
Loss = 1.1166e-02, PNorm = 408.0557, GNorm = 0.0191, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.124166
Epoch 2365
Loss = 4.6166e-02, PNorm = 408.1177, GNorm = 1.3610, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.082212
Epoch 2366
Loss = 2.3597e-02, PNorm = 408.1736, GNorm = 0.2957, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.073198
Epoch 2367
Loss = 8.9435e-03, PNorm = 408.2726, GNorm = 0.0777, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.096039
Epoch 2368
Loss = 2.4618e-02, PNorm = 408.4379, GNorm = 0.8951, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.092238
Epoch 2369
Loss = 1.7429e-02, PNorm = 408.5824, GNorm = 0.5078, lr_0 = 9.9867e-04
Loss = 2.6760e-02, PNorm = 408.7081, GNorm = 0.2830, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.074248
Epoch 2370
Loss = 2.5085e-02, PNorm = 408.8094, GNorm = 2.6817, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.152585
Epoch 2371
Loss = 2.8263e-02, PNorm = 408.8796, GNorm = 1.1363, lr_0 = 9.9867e-04
Validation binary_cross_entropy = 0.074505
Epoch 2372
Loss = 4.0446e-02, PNorm = 409.0034, GNorm = 0.0390, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.095349
Epoch 2373
Loss = 2.6184e-02, PNorm = 409.1136, GNorm = 0.0721, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.090622
Epoch 2374
Loss = 3.9641e-02, PNorm = 409.1917, GNorm = 0.3833, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.068504
Epoch 2375
Loss = 4.3045e-02, PNorm = 409.2585, GNorm = 3.1043, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.074421
Epoch 2376
Loss = 9.6057e-03, PNorm = 409.3673, GNorm = 0.0406, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.097067
Epoch 2377
Loss = 8.3307e-04, PNorm = 409.4425, GNorm = 0.0598, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.094615
Epoch 2378
Loss = 5.6752e-04, PNorm = 409.5016, GNorm = 0.0366, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.068639
Epoch 2379
Loss = 4.6657e-02, PNorm = 409.5545, GNorm = 1.4321, lr_0 = 9.9866e-04
Loss = 2.6890e-02, PNorm = 409.6318, GNorm = 0.7050, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.057434
Epoch 2380
Loss = 3.0555e-02, PNorm = 409.7144, GNorm = 1.3077, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.075425
Epoch 2381
Loss = 2.8308e-02, PNorm = 409.7766, GNorm = 0.8120, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.067218
Epoch 2382
Loss = 1.2113e-02, PNorm = 409.8311, GNorm = 0.1360, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.072058
Epoch 2383
Loss = 5.7829e-02, PNorm = 409.8814, GNorm = 0.1655, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.061794
Epoch 2384
Loss = 2.1132e-02, PNorm = 409.9480, GNorm = 0.9432, lr_0 = 9.9866e-04
Validation binary_cross_entropy = 0.068028
Epoch 2385
Loss = 5.7444e-02, PNorm = 410.0062, GNorm = 0.2115, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.061127
Epoch 2386
Loss = 1.9814e-02, PNorm = 410.0506, GNorm = 0.6352, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.062969
Epoch 2387
Loss = 4.3260e-03, PNorm = 410.1225, GNorm = 0.1023, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.071360
Epoch 2388
Loss = 1.4825e-02, PNorm = 410.1759, GNorm = 0.9822, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.072995
Epoch 2389
Loss = 3.0954e-02, PNorm = 410.2299, GNorm = 1.5716, lr_0 = 9.9865e-04
Loss = 2.8832e-02, PNorm = 410.2768, GNorm = 0.3353, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.072448
Epoch 2390
Loss = 4.6385e-02, PNorm = 410.3498, GNorm = 3.9673, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.078363
Epoch 2391
Loss = 2.0168e-02, PNorm = 410.4516, GNorm = 0.9720, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.105810
Epoch 2392
Loss = 2.2930e-02, PNorm = 410.5253, GNorm = 0.0205, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.074699
Epoch 2393
Loss = 3.1907e-02, PNorm = 410.5973, GNorm = 0.1285, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.110779
Epoch 2394
Loss = 4.1191e-02, PNorm = 410.6795, GNorm = 0.2055, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.095636
Epoch 2395
Loss = 1.9082e-02, PNorm = 410.7647, GNorm = 0.5640, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.070865
Epoch 2396
Loss = 9.7523e-03, PNorm = 410.8557, GNorm = 0.1836, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.076191
Epoch 2397
Loss = 2.8847e-03, PNorm = 410.9320, GNorm = 0.1289, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.077948
Epoch 2398
Loss = 1.3865e-01, PNorm = 410.9948, GNorm = 2.8986, lr_0 = 9.9865e-04
Validation binary_cross_entropy = 0.085360
Epoch 2399
Loss = 3.9645e-02, PNorm = 411.0735, GNorm = 0.8855, lr_0 = 9.9864e-04
Loss = 3.6267e-02, PNorm = 411.1645, GNorm = 0.4184, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.084581
Epoch 2400
Loss = 2.5320e-02, PNorm = 411.2605, GNorm = 1.2232, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.097515
Epoch 2401
Loss = 3.4841e-02, PNorm = 411.3477, GNorm = 0.6206, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.063058
Epoch 2402
Loss = 2.4678e-02, PNorm = 411.4358, GNorm = 0.2827, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.069187
Epoch 2403
Loss = 1.8556e-02, PNorm = 411.5196, GNorm = 0.1186, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.091582
Epoch 2404
Loss = 5.4226e-03, PNorm = 411.5882, GNorm = 0.4547, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.103402
Epoch 2405
Loss = 9.6918e-03, PNorm = 411.6302, GNorm = 0.0064, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.081462
Epoch 2406
Loss = 5.0852e-02, PNorm = 411.6853, GNorm = 1.1073, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.080275
Epoch 2407
Loss = 5.0367e-02, PNorm = 411.7955, GNorm = 0.3083, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.093437
Epoch 2408
Loss = 2.0002e-03, PNorm = 411.8914, GNorm = 0.1241, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.089190
Epoch 2409
Loss = 6.8504e-03, PNorm = 411.9536, GNorm = 0.1902, lr_0 = 9.9864e-04
Loss = 1.7112e-02, PNorm = 412.0115, GNorm = 1.1526, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.075044
Epoch 2410
Loss = 4.0981e-02, PNorm = 412.0920, GNorm = 0.1702, lr_0 = 9.9864e-04
Validation binary_cross_entropy = 0.076624
Epoch 2411
Loss = 1.5856e-02, PNorm = 412.1978, GNorm = 0.0954, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.090117
Epoch 2412
Loss = 6.0755e-02, PNorm = 412.3035, GNorm = 0.3595, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.076129
Epoch 2413
Loss = 4.9490e-02, PNorm = 412.3893, GNorm = 0.5881, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.068357
Epoch 2414
Loss = 6.9464e-03, PNorm = 412.4884, GNorm = 0.0739, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.070510
Epoch 2415
Loss = 1.6252e-02, PNorm = 412.5705, GNorm = 0.0178, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.073290
Epoch 2416
Loss = 1.8715e-02, PNorm = 412.6309, GNorm = 0.2124, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.079750
Epoch 2417
Loss = 2.1378e-02, PNorm = 412.6972, GNorm = 0.2954, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.055973
Epoch 2418
Loss = 2.8418e-02, PNorm = 412.7657, GNorm = 0.1652, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.061480
Epoch 2419
Loss = 2.3696e-02, PNorm = 412.8699, GNorm = 0.8462, lr_0 = 9.9863e-04
Loss = 3.5387e-02, PNorm = 412.9650, GNorm = 0.2606, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.050535
Epoch 2420
Loss = 1.3870e-02, PNorm = 413.0505, GNorm = 0.4196, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.057334
Epoch 2421
Loss = 1.6737e-02, PNorm = 413.1231, GNorm = 0.0372, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.061590
Epoch 2422
Loss = 1.5439e-02, PNorm = 413.1912, GNorm = 0.0019, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.112497
Epoch 2423
Loss = 4.2284e-03, PNorm = 413.2478, GNorm = 1.3589, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.111756
Epoch 2424
Loss = 3.0409e-03, PNorm = 413.2970, GNorm = 0.0963, lr_0 = 9.9863e-04
Validation binary_cross_entropy = 0.090640
Epoch 2425
Loss = 6.6464e-02, PNorm = 413.3384, GNorm = 0.3804, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.062083
Epoch 2426
Loss = 2.0756e-02, PNorm = 413.4061, GNorm = 0.0376, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.073748
Epoch 2427
Loss = 2.1553e-02, PNorm = 413.4904, GNorm = 0.8772, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.058661
Epoch 2428
Loss = 8.2307e-03, PNorm = 413.5766, GNorm = 0.0768, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.069478
Epoch 2429
Loss = 3.1634e-02, PNorm = 413.6645, GNorm = 0.4357, lr_0 = 9.9862e-04
Loss = 2.3076e-02, PNorm = 413.7383, GNorm = 0.9782, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.063596
Epoch 2430
Loss = 3.4738e-02, PNorm = 413.8002, GNorm = 0.2229, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.048726
Epoch 2431
Loss = 3.1884e-02, PNorm = 413.8742, GNorm = 0.2519, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.056698
Epoch 2432
Loss = 1.9130e-02, PNorm = 413.9330, GNorm = 0.4064, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.056048
Epoch 2433
Loss = 6.4205e-03, PNorm = 413.9946, GNorm = 0.1914, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.063883
Epoch 2434
Loss = 2.3323e-02, PNorm = 414.0396, GNorm = 0.0471, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.068816
Epoch 2435
Loss = 4.7889e-03, PNorm = 414.0892, GNorm = 0.1383, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.069988
Epoch 2436
Loss = 6.3994e-02, PNorm = 414.1274, GNorm = 1.1480, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.063591
Epoch 2437
Loss = 4.5379e-03, PNorm = 414.1741, GNorm = 0.0397, lr_0 = 9.9862e-04
Validation binary_cross_entropy = 0.064800
Epoch 2438
Loss = 3.6250e-03, PNorm = 414.2115, GNorm = 0.3261, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.068208
Epoch 2439
Loss = 2.3757e-03, PNorm = 414.2500, GNorm = 0.1230, lr_0 = 9.9861e-04
Loss = 1.6211e-02, PNorm = 414.2902, GNorm = 0.2466, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.074067
Epoch 2440
Loss = 3.3120e-02, PNorm = 414.3255, GNorm = 0.3061, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.058458
Epoch 2441
Loss = 2.5958e-02, PNorm = 414.3764, GNorm = 0.0818, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.053437
Epoch 2442
Loss = 1.7790e-02, PNorm = 414.4563, GNorm = 0.1329, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.055507
Epoch 2443
Loss = 1.0488e-02, PNorm = 414.5370, GNorm = 0.8246, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.080463
Epoch 2444
Loss = 5.2612e-02, PNorm = 414.5985, GNorm = 1.8126, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.056799
Epoch 2445
Loss = 5.8519e-03, PNorm = 414.6691, GNorm = 0.3274, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.057842
Epoch 2446
Loss = 2.6892e-02, PNorm = 414.7344, GNorm = 0.6338, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.065405
Epoch 2447
Loss = 5.3270e-03, PNorm = 414.7990, GNorm = 0.0066, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.103963
Epoch 2448
Loss = 3.6040e-02, PNorm = 414.8504, GNorm = 2.3412, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.078488
Epoch 2449
Loss = 5.1946e-04, PNorm = 414.8922, GNorm = 0.0227, lr_0 = 9.9861e-04
Loss = 1.4495e-02, PNorm = 414.9316, GNorm = 6.6064, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.071624
Epoch 2450
Loss = 5.8619e-02, PNorm = 414.9938, GNorm = 0.0123, lr_0 = 9.9861e-04
Validation binary_cross_entropy = 0.071669
Epoch 2451
Loss = 1.1621e-02, PNorm = 415.0659, GNorm = 0.0851, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.048662
Epoch 2452
Loss = 6.9975e-03, PNorm = 415.1366, GNorm = 0.0713, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.054336
Epoch 2453
Loss = 3.5899e-02, PNorm = 415.1987, GNorm = 0.4791, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.062472
Epoch 2454
Loss = 6.4057e-03, PNorm = 415.2538, GNorm = 0.5254, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.056468
Epoch 2455
Loss = 4.8080e-03, PNorm = 415.3033, GNorm = 0.2587, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.059368
Epoch 2456
Loss = 1.1532e-02, PNorm = 415.3588, GNorm = 0.7281, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.115045
Epoch 2457
Loss = 2.8729e-02, PNorm = 415.4165, GNorm = 0.0723, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.068988
Epoch 2458
Loss = 5.9051e-02, PNorm = 415.4620, GNorm = 3.0386, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.057419
Epoch 2459
Loss = 2.6944e-03, PNorm = 415.5093, GNorm = 0.1161, lr_0 = 9.9860e-04
Loss = 2.3059e-02, PNorm = 415.5710, GNorm = 1.2232, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.060193
Epoch 2460
Loss = 2.7183e-02, PNorm = 415.6449, GNorm = 0.4430, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.052614
Epoch 2461
Loss = 8.8886e-03, PNorm = 415.7266, GNorm = 0.2461, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.069428
Epoch 2462
Loss = 1.2442e-02, PNorm = 415.7839, GNorm = 2.6830, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.084179
Epoch 2463
Loss = 1.6749e-02, PNorm = 415.8369, GNorm = 1.5680, lr_0 = 9.9860e-04
Validation binary_cross_entropy = 0.078042
Epoch 2464
Loss = 1.3961e-02, PNorm = 415.9192, GNorm = 0.9475, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.075707
Epoch 2465
Loss = 1.9792e-02, PNorm = 415.9806, GNorm = 1.5365, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.070979
Epoch 2466
Loss = 9.7534e-03, PNorm = 416.0439, GNorm = 0.8240, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.076486
Epoch 2467
Loss = 2.9250e-02, PNorm = 416.1016, GNorm = 0.6987, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.061419
Epoch 2468
Loss = 2.3970e-03, PNorm = 416.1670, GNorm = 0.0413, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.065604
Epoch 2469
Loss = 1.5327e-03, PNorm = 416.2250, GNorm = 0.0443, lr_0 = 9.9859e-04
Loss = 2.0059e-02, PNorm = 416.2901, GNorm = 0.4781, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.048761
Epoch 2470
Loss = 1.3123e-02, PNorm = 416.3616, GNorm = 0.0870, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.054316
Epoch 2471
Loss = 1.3094e-02, PNorm = 416.4252, GNorm = 0.0422, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.063181
Epoch 2472
Loss = 2.0771e-02, PNorm = 416.4767, GNorm = 9.6721, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.059189
Epoch 2473
Loss = 1.7757e-02, PNorm = 416.5908, GNorm = 1.8774, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.085228
Epoch 2474
Loss = 2.3159e-02, PNorm = 416.7089, GNorm = 0.1748, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.062983
Epoch 2475
Loss = 2.6455e-02, PNorm = 416.8223, GNorm = 0.3557, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.058104
Epoch 2476
Loss = 2.7292e-02, PNorm = 416.9308, GNorm = 1.7072, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.053788
Epoch 2477
Loss = 5.0062e-02, PNorm = 417.0427, GNorm = 0.0298, lr_0 = 9.9859e-04
Validation binary_cross_entropy = 0.054541
Epoch 2478
Loss = 1.8813e-02, PNorm = 417.1402, GNorm = 0.8748, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.052165
Epoch 2479
Loss = 1.7125e-02, PNorm = 417.2270, GNorm = 0.3212, lr_0 = 9.9858e-04
Loss = 2.5708e-02, PNorm = 417.3082, GNorm = 1.7768, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.051576
Epoch 2480
Loss = 2.9627e-02, PNorm = 417.4000, GNorm = 0.0400, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.068413
Epoch 2481
Loss = 4.0003e-02, PNorm = 417.4809, GNorm = 0.6929, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.047192
Epoch 2482
Loss = 1.4898e-02, PNorm = 417.5729, GNorm = 0.3518, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.056937
Epoch 2483
Loss = 3.1001e-02, PNorm = 417.6515, GNorm = 0.8227, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.056609
Epoch 2484
Loss = 4.8273e-02, PNorm = 417.7177, GNorm = 0.0982, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.050723
Epoch 2485
Loss = 7.7271e-02, PNorm = 417.7847, GNorm = 0.2973, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.054693
Epoch 2486
Loss = 1.3275e-02, PNorm = 417.8826, GNorm = 0.4449, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.053227
Epoch 2487
Loss = 4.2973e-03, PNorm = 417.9668, GNorm = 0.3265, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.051672
Epoch 2488
Loss = 4.5923e-02, PNorm = 418.0407, GNorm = 2.3049, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.048122
Epoch 2489
Loss = 2.5934e-02, PNorm = 418.1400, GNorm = 0.7757, lr_0 = 9.9858e-04
Loss = 4.9255e-02, PNorm = 418.2455, GNorm = 2.8239, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.047368
Epoch 2490
Loss = 2.8427e-02, PNorm = 418.3574, GNorm = 0.5573, lr_0 = 9.9858e-04
Validation binary_cross_entropy = 0.078105
Epoch 2491
Loss = 4.8786e-02, PNorm = 418.4563, GNorm = 0.2444, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.055247
Epoch 2492
Loss = 3.9075e-02, PNorm = 418.5468, GNorm = 1.4780, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.046947
Epoch 2493
Loss = 1.0861e-02, PNorm = 418.6480, GNorm = 0.4289, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.061329
Epoch 2494
Loss = 2.3474e-02, PNorm = 418.7203, GNorm = 1.5542, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.061710
Epoch 2495
Loss = 5.8334e-03, PNorm = 418.7837, GNorm = 0.2910, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.047464
Epoch 2496
Loss = 4.3716e-03, PNorm = 418.8393, GNorm = 0.1317, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.048375
Epoch 2497
Loss = 9.4706e-03, PNorm = 418.9028, GNorm = 0.2675, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.083732
Epoch 2498
Loss = 4.8003e-02, PNorm = 418.9669, GNorm = 0.6417, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.058312
Epoch 2499
Loss = 1.3894e-03, PNorm = 419.0230, GNorm = 0.0705, lr_0 = 9.9857e-04
Loss = 3.5757e-02, PNorm = 419.0921, GNorm = 0.9534, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.052674
Epoch 2500
Loss = 4.8251e-02, PNorm = 419.1685, GNorm = 0.1764, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.046443
Epoch 2501
Loss = 1.9737e-02, PNorm = 419.2488, GNorm = 0.1298, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.050706
Epoch 2502
Loss = 2.6919e-02, PNorm = 419.3375, GNorm = 1.3046, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.057714
Epoch 2503
Loss = 1.7852e-02, PNorm = 419.4156, GNorm = 1.5222, lr_0 = 9.9857e-04
Validation binary_cross_entropy = 0.056179
Epoch 2504
Loss = 8.0688e-03, PNorm = 419.4733, GNorm = 0.7449, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.061641
Epoch 2505
Loss = 3.1720e-03, PNorm = 419.5280, GNorm = 0.0343, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.095508
Epoch 2506
Loss = 3.7505e-04, PNorm = 419.5684, GNorm = 0.0474, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.086662
Epoch 2507
Loss = 1.0379e-01, PNorm = 419.5985, GNorm = 3.2040, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.055947
Epoch 2508
Loss = 1.4968e-02, PNorm = 419.7256, GNorm = 0.0627, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.089474
Epoch 2509
Loss = 2.8259e-02, PNorm = 419.8618, GNorm = 0.7650, lr_0 = 9.9856e-04
Loss = 4.6943e-02, PNorm = 419.9855, GNorm = 1.5647, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.059187
Epoch 2510
Loss = 2.3724e-02, PNorm = 420.0942, GNorm = 0.6874, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.061868
Epoch 2511
Loss = 4.4416e-02, PNorm = 420.2096, GNorm = 0.4106, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.058277
Epoch 2512
Loss = 2.9771e-02, PNorm = 420.3268, GNorm = 1.0644, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.059116
Epoch 2513
Loss = 2.6177e-02, PNorm = 420.4233, GNorm = 0.1646, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.058903
Epoch 2514
Loss = 3.1188e-02, PNorm = 420.5202, GNorm = 0.0981, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.072848
Epoch 2515
Loss = 3.1289e-02, PNorm = 420.6150, GNorm = 2.4536, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.071890
Epoch 2516
Loss = 1.4781e-02, PNorm = 420.6885, GNorm = 0.8039, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.052456
Epoch 2517
Loss = 3.2449e-02, PNorm = 420.7704, GNorm = 2.0254, lr_0 = 9.9856e-04
Validation binary_cross_entropy = 0.060559
Epoch 2518
Loss = 9.8567e-03, PNorm = 420.8560, GNorm = 0.3138, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.050184
Epoch 2519
Loss = 4.7558e-03, PNorm = 420.9239, GNorm = 0.1095, lr_0 = 9.9855e-04
Loss = 1.2141e-02, PNorm = 420.9912, GNorm = 0.1154, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.047155
Epoch 2520
Loss = 2.9821e-02, PNorm = 421.0554, GNorm = 1.0358, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.057275
Epoch 2521
Loss = 2.7388e-02, PNorm = 421.1401, GNorm = 0.4538, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.072185
Epoch 2522
Loss = 1.1371e-02, PNorm = 421.2135, GNorm = 0.4308, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.076240
Epoch 2523
Loss = 7.5216e-02, PNorm = 421.2681, GNorm = 0.0921, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.048445
Epoch 2524
Loss = 4.4551e-02, PNorm = 421.3732, GNorm = 0.4494, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.058127
Epoch 2525
Loss = 1.7309e-02, PNorm = 421.4879, GNorm = 0.4114, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.072276
Epoch 2526
Loss = 5.7549e-03, PNorm = 421.5824, GNorm = 0.4345, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.088038
Epoch 2527
Loss = 5.2027e-03, PNorm = 421.6594, GNorm = 0.3741, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.091767
Epoch 2528
Loss = 3.0526e-02, PNorm = 421.7242, GNorm = 0.2757, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.076054
Epoch 2529
Loss = 1.4572e-03, PNorm = 421.8056, GNorm = 0.0345, lr_0 = 9.9855e-04
Loss = 1.5397e-02, PNorm = 421.8787, GNorm = 1.1829, lr_0 = 9.9855e-04
Validation binary_cross_entropy = 0.096725
Epoch 2530
Loss = 2.5662e-02, PNorm = 421.9344, GNorm = 0.2781, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.066525
Epoch 2531
Loss = 7.2364e-03, PNorm = 421.9957, GNorm = 0.5714, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.083412
Epoch 2532
Loss = 1.6705e-02, PNorm = 422.0450, GNorm = 1.2497, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.102760
Epoch 2533
Loss = 2.2289e-03, PNorm = 422.0965, GNorm = 0.1374, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.052637
Epoch 2534
Loss = 6.9885e-02, PNorm = 422.2766, GNorm = 0.4081, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.139990
Epoch 2535
Loss = 7.0131e-02, PNorm = 422.4541, GNorm = 2.8756, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.051989
Epoch 2536
Loss = 8.8182e-02, PNorm = 422.6012, GNorm = 2.0637, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.070389
Epoch 2537
Loss = 1.2387e-02, PNorm = 422.7372, GNorm = 0.1338, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.075047
Epoch 2538
Loss = 1.4681e-02, PNorm = 422.8333, GNorm = 0.7488, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.073067
Epoch 2539
Loss = 3.8922e-03, PNorm = 422.9239, GNorm = 0.1193, lr_0 = 9.9854e-04
Loss = 2.6812e-02, PNorm = 422.9943, GNorm = 0.4251, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.072286
Epoch 2540
Loss = 2.7730e-02, PNorm = 423.0635, GNorm = 0.4606, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.054491
Epoch 2541
Loss = 2.8705e-02, PNorm = 423.1368, GNorm = 1.3942, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.071637
Epoch 2542
Loss = 2.3657e-02, PNorm = 423.2090, GNorm = 3.5213, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.078752
Epoch 2543
Loss = 1.1115e-02, PNorm = 423.2691, GNorm = 0.0510, lr_0 = 9.9854e-04
Validation binary_cross_entropy = 0.074056
Epoch 2544
Loss = 2.1994e-02, PNorm = 423.3205, GNorm = 0.5561, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.076056
Epoch 2545
Loss = 6.0896e-02, PNorm = 423.3758, GNorm = 1.6709, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.071332
Epoch 2546
Loss = 3.6588e-02, PNorm = 423.4448, GNorm = 0.7258, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.077792
Epoch 2547
Loss = 2.1922e-02, PNorm = 423.5055, GNorm = 1.3766, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.065610
Epoch 2548
Loss = 1.7774e-02, PNorm = 423.5601, GNorm = 0.7382, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.067491
Epoch 2549
Loss = 1.9739e-03, PNorm = 423.6286, GNorm = 0.0536, lr_0 = 9.9853e-04
Loss = 3.5148e-02, PNorm = 423.7016, GNorm = 1.0684, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.060745
Epoch 2550
Loss = 4.0263e-02, PNorm = 423.7653, GNorm = 0.7905, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.066879
Epoch 2551
Loss = 3.6397e-02, PNorm = 423.8274, GNorm = 0.5686, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.053331
Epoch 2552
Loss = 1.8462e-02, PNorm = 423.8871, GNorm = 0.5008, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.059528
Epoch 2553
Loss = 5.4283e-02, PNorm = 423.9358, GNorm = 0.8676, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.056421
Epoch 2554
Loss = 2.3711e-02, PNorm = 423.9938, GNorm = 0.9385, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.058327
Epoch 2555
Loss = 4.5307e-03, PNorm = 424.0545, GNorm = 0.4069, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.077774
Epoch 2556
Loss = 1.0845e-02, PNorm = 424.1016, GNorm = 0.0487, lr_0 = 9.9853e-04
Validation binary_cross_entropy = 0.084155
Epoch 2557
Loss = 1.4803e-02, PNorm = 424.1441, GNorm = 1.1460, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.057697
Epoch 2558
Loss = 3.0690e-03, PNorm = 424.2132, GNorm = 0.3217, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.051242
Epoch 2559
Loss = 6.2351e-03, PNorm = 424.3202, GNorm = 0.1928, lr_0 = 9.9852e-04
Loss = 2.9940e-02, PNorm = 424.4186, GNorm = 0.8331, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.058046
Epoch 2560
Loss = 2.3906e-02, PNorm = 424.5106, GNorm = 1.3263, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.057298
Epoch 2561
Loss = 8.5618e-02, PNorm = 424.6180, GNorm = 1.2689, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.073026
Epoch 2562
Loss = 3.3682e-02, PNorm = 424.7353, GNorm = 0.1591, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.052894
Epoch 2563
Loss = 1.5859e-02, PNorm = 424.8422, GNorm = 0.3893, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.054616
Epoch 2564
Loss = 7.5950e-03, PNorm = 424.9278, GNorm = 0.1341, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.074901
Epoch 2565
Loss = 3.7313e-02, PNorm = 424.9926, GNorm = 1.1524, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.065198
Epoch 2566
Loss = 6.4929e-03, PNorm = 425.0412, GNorm = 0.0710, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.060212
Epoch 2567
Loss = 4.8015e-03, PNorm = 425.1156, GNorm = 0.1280, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.067335
Epoch 2568
Loss = 8.7559e-02, PNorm = 425.1929, GNorm = 1.2070, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.090343
Epoch 2569
Loss = 1.5939e-02, PNorm = 425.2752, GNorm = 0.4580, lr_0 = 9.9852e-04
Loss = 2.0962e-02, PNorm = 425.3512, GNorm = 2.6898, lr_0 = 9.9852e-04
Validation binary_cross_entropy = 0.076238
Epoch 2570
Loss = 2.8238e-02, PNorm = 425.4157, GNorm = 2.1410, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.064421
Epoch 2571
Loss = 9.5396e-03, PNorm = 425.4872, GNorm = 0.0717, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.064353
Epoch 2572
Loss = 6.2101e-02, PNorm = 425.5607, GNorm = 2.0188, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.056410
Epoch 2573
Loss = 6.8159e-03, PNorm = 425.6461, GNorm = 0.2911, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.070105
Epoch 2574
Loss = 6.5188e-02, PNorm = 425.7107, GNorm = 1.7415, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.059050
Epoch 2575
Loss = 6.5006e-02, PNorm = 425.7897, GNorm = 0.0696, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.067953
Epoch 2576
Loss = 1.4442e-02, PNorm = 425.8833, GNorm = 0.0895, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.064001
Epoch 2577
Loss = 2.2252e-02, PNorm = 425.9601, GNorm = 0.2408, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.070496
Epoch 2578
Loss = 4.1552e-02, PNorm = 426.0409, GNorm = 1.3995, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.077242
Epoch 2579
Loss = 1.6604e-02, PNorm = 426.1206, GNorm = 0.6721, lr_0 = 9.9851e-04
Loss = 3.8176e-02, PNorm = 426.1758, GNorm = 0.6116, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.064606
Epoch 2580
Loss = 2.1419e-02, PNorm = 426.2654, GNorm = 0.2760, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.057792
Epoch 2581
Loss = 2.5623e-02, PNorm = 426.3719, GNorm = 0.5825, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.069112
Epoch 2582
Loss = 2.2003e-02, PNorm = 426.4446, GNorm = 3.1006, lr_0 = 9.9851e-04
Validation binary_cross_entropy = 0.069607
Epoch 2583
Loss = 2.3329e-02, PNorm = 426.5198, GNorm = 0.2824, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.066072
Epoch 2584
Loss = 4.3333e-02, PNorm = 426.5946, GNorm = 23.7044, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.081883
Epoch 2585
Loss = 2.4437e-02, PNorm = 426.7151, GNorm = 1.7221, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.066637
Epoch 2586
Loss = 7.5628e-02, PNorm = 426.8537, GNorm = 1.3863, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.083075
Epoch 2587
Loss = 1.2629e-02, PNorm = 426.9984, GNorm = 0.3510, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.092711
Epoch 2588
Loss = 4.7684e-02, PNorm = 427.0825, GNorm = 1.9941, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.087171
Epoch 2589
Loss = 9.1759e-03, PNorm = 427.1481, GNorm = 0.4213, lr_0 = 9.9850e-04
Loss = 5.0555e-02, PNorm = 427.2245, GNorm = 1.5307, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.095478
Epoch 2590
Loss = 4.1620e-02, PNorm = 427.2945, GNorm = 2.0101, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.058298
Epoch 2591
Loss = 4.3719e-02, PNorm = 427.3863, GNorm = 0.2121, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.058228
Epoch 2592
Loss = 1.3085e-02, PNorm = 427.5007, GNorm = 0.8159, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.067451
Epoch 2593
Loss = 4.0436e-02, PNorm = 427.5955, GNorm = 1.4366, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.073406
Epoch 2594
Loss = 6.4810e-02, PNorm = 427.6956, GNorm = 1.6607, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.055620
Epoch 2595
Loss = 1.5194e-02, PNorm = 427.8015, GNorm = 0.1317, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.063868
Epoch 2596
Loss = 2.8185e-02, PNorm = 427.8965, GNorm = 0.1668, lr_0 = 9.9850e-04
Validation binary_cross_entropy = 0.058516
Epoch 2597
Loss = 6.7412e-02, PNorm = 427.9997, GNorm = 1.4045, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.083009
Epoch 2598
Loss = 6.6944e-03, PNorm = 428.1240, GNorm = 0.3172, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.081586
Epoch 2599
Loss = 6.4067e-03, PNorm = 428.2281, GNorm = 0.3528, lr_0 = 9.9849e-04
Loss = 5.7376e-03, PNorm = 428.3158, GNorm = 0.0650, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.093704
Epoch 2600
Loss = 3.0232e-02, PNorm = 428.3854, GNorm = 0.1206, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.058816
Epoch 2601
Loss = 5.2385e-02, PNorm = 428.4740, GNorm = 1.0883, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.071435
Epoch 2602
Loss = 5.6012e-02, PNorm = 428.5814, GNorm = 1.5351, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.055655
Epoch 2603
Loss = 5.5271e-02, PNorm = 428.6878, GNorm = 0.3065, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.065555
Epoch 2604
Loss = 3.2903e-02, PNorm = 428.7916, GNorm = 1.2929, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.065978
Epoch 2605
Loss = 1.5883e-02, PNorm = 428.9090, GNorm = 0.1723, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.110364
Epoch 2606
Loss = 2.2463e-02, PNorm = 429.0198, GNorm = 1.6430, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.081270
Epoch 2607
Loss = 1.0690e-03, PNorm = 429.1068, GNorm = 0.0145, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.119486
Epoch 2608
Loss = 5.6184e-02, PNorm = 429.1903, GNorm = 2.9588, lr_0 = 9.9849e-04
Validation binary_cross_entropy = 0.107440
Epoch 2609
Loss = 2.3304e-03, PNorm = 429.2708, GNorm = 0.1394, lr_0 = 9.9849e-04
Loss = 4.6208e-02, PNorm = 429.3516, GNorm = 0.1727, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.087839
Epoch 2610
Loss = 1.8793e-02, PNorm = 429.4422, GNorm = 0.0628, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.071926
Epoch 2611
Loss = 5.0500e-02, PNorm = 429.5191, GNorm = 0.1221, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.067375
Epoch 2612
Loss = 3.3399e-02, PNorm = 429.6018, GNorm = 0.1100, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.069549
Epoch 2613
Loss = 1.4941e-02, PNorm = 429.6738, GNorm = 0.0307, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.094672
Epoch 2614
Loss = 4.0328e-02, PNorm = 429.7216, GNorm = 0.4234, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.081236
Epoch 2615
Loss = 2.5678e-03, PNorm = 429.7557, GNorm = 0.0407, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.076238
Epoch 2616
Loss = 4.7042e-03, PNorm = 429.8022, GNorm = 0.2668, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.144214
Epoch 2617
Loss = 5.3266e-02, PNorm = 429.8686, GNorm = 1.4497, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.088074
Epoch 2618
Loss = 1.3974e-03, PNorm = 429.9156, GNorm = 0.0090, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.059953
Epoch 2619
Loss = 1.8131e-02, PNorm = 429.9756, GNorm = 0.2864, lr_0 = 9.9848e-04
Loss = 3.0308e-02, PNorm = 430.0731, GNorm = 2.4440, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.085665
Epoch 2620
Loss = 1.0578e-02, PNorm = 430.1488, GNorm = 0.0665, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.087211
Epoch 2621
Loss = 7.4782e-02, PNorm = 430.2280, GNorm = 0.4504, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.068614
Epoch 2622
Loss = 1.4935e-02, PNorm = 430.3605, GNorm = 0.5026, lr_0 = 9.9848e-04
Validation binary_cross_entropy = 0.137311
Epoch 2623
Loss = 5.6755e-02, PNorm = 430.4592, GNorm = 0.2342, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.074720
Epoch 2624
Loss = 2.6321e-02, PNorm = 430.5663, GNorm = 0.0553, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.088414
Epoch 2625
Loss = 5.0589e-03, PNorm = 430.6648, GNorm = 0.1752, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.076292
Epoch 2626
Loss = 3.9016e-03, PNorm = 430.7558, GNorm = 0.0765, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.090335
Epoch 2627
Loss = 8.3095e-03, PNorm = 430.8551, GNorm = 0.3573, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.108766
Epoch 2628
Loss = 1.2384e-02, PNorm = 430.9356, GNorm = 1.3089, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.102897
Epoch 2629
Loss = 7.7793e-03, PNorm = 431.0082, GNorm = 0.5107, lr_0 = 9.9847e-04
Loss = 6.2018e-02, PNorm = 431.0965, GNorm = 1.7028, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.088163
Epoch 2630
Loss = 1.6124e-02, PNorm = 431.2032, GNorm = 0.0380, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.127487
Epoch 2631
Loss = 8.2883e-02, PNorm = 431.2855, GNorm = 2.8521, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.074211
Epoch 2632
Loss = 3.1944e-02, PNorm = 431.3816, GNorm = 2.3756, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.103994
Epoch 2633
Loss = 5.5891e-02, PNorm = 431.4825, GNorm = 1.0141, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.075256
Epoch 2634
Loss = 1.9785e-02, PNorm = 431.5898, GNorm = 0.4014, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.088693
Epoch 2635
Loss = 4.7328e-02, PNorm = 431.6780, GNorm = 3.0639, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.088653
Epoch 2636
Loss = 7.1326e-02, PNorm = 431.7626, GNorm = 1.8348, lr_0 = 9.9847e-04
Validation binary_cross_entropy = 0.063191
Epoch 2637
Loss = 3.0242e-02, PNorm = 431.8548, GNorm = 0.0841, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.086185
Epoch 2638
Loss = 4.2197e-02, PNorm = 431.9343, GNorm = 1.6519, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.102618
Epoch 2639
Loss = 8.1549e-04, PNorm = 431.9876, GNorm = 0.0289, lr_0 = 9.9846e-04
Loss = 5.2688e-02, PNorm = 432.0425, GNorm = 0.0334, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.072198
Epoch 2640
Loss = 2.5017e-02, PNorm = 432.1582, GNorm = 0.1223, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.085132
Epoch 2641
Loss = 1.8742e-02, PNorm = 432.2528, GNorm = 0.1022, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.125029
Epoch 2642
Loss = 2.7408e-02, PNorm = 432.3241, GNorm = 0.0479, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.099688
Epoch 2643
Loss = 1.9507e-02, PNorm = 432.3929, GNorm = 2.3083, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.161168
Epoch 2644
Loss = 4.4393e-02, PNorm = 432.4796, GNorm = 0.0019, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.119698
Epoch 2645
Loss = 1.0923e-02, PNorm = 432.5433, GNorm = 0.2002, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.076915
Epoch 2646
Loss = 4.6483e-02, PNorm = 432.6212, GNorm = 0.7061, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.085255
Epoch 2647
Loss = 2.2591e-02, PNorm = 432.6983, GNorm = 1.2299, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.071810
Epoch 2648
Loss = 5.4206e-03, PNorm = 432.7558, GNorm = 0.1563, lr_0 = 9.9846e-04
Validation binary_cross_entropy = 0.075263
Epoch 2649
Loss = 1.8017e-02, PNorm = 432.8107, GNorm = 0.4099, lr_0 = 9.9846e-04
Loss = 8.5363e-03, PNorm = 432.8632, GNorm = 0.2691, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.091669
Epoch 2650
Loss = 1.3011e-02, PNorm = 432.8998, GNorm = 0.2075, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.093870
Epoch 2651
Loss = 6.5279e-02, PNorm = 432.9383, GNorm = 0.3941, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.068327
Epoch 2652
Loss = 9.2078e-03, PNorm = 433.0011, GNorm = 0.1217, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.077879
Epoch 2653
Loss = 7.2286e-03, PNorm = 433.0641, GNorm = 0.6104, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.114840
Epoch 2654
Loss = 1.5677e-02, PNorm = 433.1051, GNorm = 2.6682, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.096395
Epoch 2655
Loss = 6.9745e-02, PNorm = 433.1466, GNorm = 6.7781, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.101880
Epoch 2656
Loss = 3.3751e-02, PNorm = 433.2504, GNorm = 0.7108, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.102594
Epoch 2657
Loss = 9.7636e-03, PNorm = 433.3507, GNorm = 0.4883, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.118377
Epoch 2658
Loss = 3.2609e-03, PNorm = 433.4336, GNorm = 0.1228, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.065385
Epoch 2659
Loss = 4.8408e-02, PNorm = 433.5139, GNorm = 1.0435, lr_0 = 9.9845e-04
Loss = 5.2321e-02, PNorm = 433.6511, GNorm = 0.7110, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.112504
Epoch 2660
Loss = 6.2331e-02, PNorm = 433.7626, GNorm = 0.6359, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.073014
Epoch 2661
Loss = 3.2554e-02, PNorm = 433.8690, GNorm = 1.0923, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.067948
Epoch 2662
Loss = 4.6552e-02, PNorm = 433.9715, GNorm = 0.0186, lr_0 = 9.9845e-04
Validation binary_cross_entropy = 0.085276
Epoch 2663
Loss = 6.2122e-03, PNorm = 434.0479, GNorm = 0.0574, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.077743
Epoch 2664
Loss = 1.1291e-02, PNorm = 434.1032, GNorm = 1.6505, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.104960
Epoch 2665
Loss = 3.5633e-02, PNorm = 434.1583, GNorm = 0.0891, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.085288
Epoch 2666
Loss = 2.5701e-02, PNorm = 434.2188, GNorm = 0.4567, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.085125
Epoch 2667
Loss = 1.5702e-02, PNorm = 434.2736, GNorm = 0.0330, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.100887
Epoch 2668
Loss = 1.0709e-01, PNorm = 434.3417, GNorm = 0.3199, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.087934
Epoch 2669
Loss = 6.2043e-03, PNorm = 434.4098, GNorm = 0.3059, lr_0 = 9.9844e-04
Loss = 3.0033e-02, PNorm = 434.4696, GNorm = 4.6797, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.091212
Epoch 2670
Loss = 6.5804e-03, PNorm = 434.5381, GNorm = 0.2270, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.129255
Epoch 2671
Loss = 1.7814e-02, PNorm = 434.5984, GNorm = 0.0154, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.121472
Epoch 2672
Loss = 2.4873e-02, PNorm = 434.6591, GNorm = 5.0905, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.099755
Epoch 2673
Loss = 4.7638e-02, PNorm = 434.7425, GNorm = 0.9704, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.093946
Epoch 2674
Loss = 2.4323e-02, PNorm = 434.8293, GNorm = 0.7539, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.116581
Epoch 2675
Loss = 5.3665e-02, PNorm = 434.8987, GNorm = 1.1720, lr_0 = 9.9844e-04
Validation binary_cross_entropy = 0.085261
Epoch 2676
Loss = 3.0046e-02, PNorm = 434.9593, GNorm = 0.2086, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.084749
Epoch 2677
Loss = 1.1070e-02, PNorm = 435.0479, GNorm = 0.1896, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.105053
Epoch 2678
Loss = 3.5050e-02, PNorm = 435.1184, GNorm = 0.0734, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.081205
Epoch 2679
Loss = 2.0929e-03, PNorm = 435.1945, GNorm = 0.0552, lr_0 = 9.9843e-04
Loss = 3.4997e-02, PNorm = 435.2828, GNorm = 2.2081, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.084322
Epoch 2680
Loss = 3.6027e-02, PNorm = 435.3846, GNorm = 0.1358, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.103440
Epoch 2681
Loss = 1.3697e-01, PNorm = 435.5602, GNorm = 2.6525, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.069731
Epoch 2682
Loss = 4.5901e-02, PNorm = 435.7916, GNorm = 1.3208, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.122986
Epoch 2683
Loss = 3.2532e-02, PNorm = 435.9482, GNorm = 0.1496, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.083589
Epoch 2684
Loss = 6.2603e-02, PNorm = 436.0576, GNorm = 0.3862, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.107615
Epoch 2685
Loss = 4.2413e-02, PNorm = 436.1879, GNorm = 0.1681, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.085833
Epoch 2686
Loss = 2.0613e-02, PNorm = 436.2972, GNorm = 0.3829, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.058600
Epoch 2687
Loss = 5.9792e-02, PNorm = 436.4003, GNorm = 0.9819, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.086820
Epoch 2688
Loss = 8.9897e-02, PNorm = 436.5162, GNorm = 0.8745, lr_0 = 9.9843e-04
Validation binary_cross_entropy = 0.079813
Epoch 2689
Loss = 5.6348e-03, PNorm = 436.6098, GNorm = 0.2003, lr_0 = 9.9843e-04
Loss = 2.3112e-02, PNorm = 436.6753, GNorm = 0.9542, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.093058
Epoch 2690
Loss = 7.2558e-02, PNorm = 436.7561, GNorm = 0.2285, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.100811
Epoch 2691
Loss = 3.0155e-02, PNorm = 436.9029, GNorm = 0.5318, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.102213
Epoch 2692
Loss = 2.0711e-02, PNorm = 437.0212, GNorm = 0.0282, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.101658
Epoch 2693
Loss = 1.9700e-02, PNorm = 437.1155, GNorm = 0.1299, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.111596
Epoch 2694
Loss = 3.3098e-02, PNorm = 437.1884, GNorm = 0.9120, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.101487
Epoch 2695
Loss = 1.6815e-02, PNorm = 437.2576, GNorm = 0.4235, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.099083
Epoch 2696
Loss = 5.1500e-02, PNorm = 437.3133, GNorm = 2.5463, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.099763
Epoch 2697
Loss = 3.7238e-02, PNorm = 437.3805, GNorm = 1.0489, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.133676
Epoch 2698
Loss = 3.4324e-03, PNorm = 437.4396, GNorm = 0.2903, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.089820
Epoch 2699
Loss = 2.6777e-03, PNorm = 437.4787, GNorm = 0.1449, lr_0 = 9.9842e-04
Loss = 2.1385e-02, PNorm = 437.5251, GNorm = 0.3277, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.088776
Epoch 2700
Loss = 9.4384e-03, PNorm = 437.5749, GNorm = 0.3348, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.109627
Epoch 2701
Loss = 5.1841e-02, PNorm = 437.6153, GNorm = 0.6889, lr_0 = 9.9842e-04
Validation binary_cross_entropy = 0.068581
Epoch 2702
Loss = 2.6971e-02, PNorm = 437.6808, GNorm = 0.2615, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.086140
Epoch 2703
Loss = 1.5677e-02, PNorm = 437.7653, GNorm = 0.3937, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.085237
Epoch 2704
Loss = 1.4709e-02, PNorm = 437.8449, GNorm = 0.0642, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.078886
Epoch 2705
Loss = 5.4481e-02, PNorm = 437.9231, GNorm = 1.6390, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.083122
Epoch 2706
Loss = 1.5459e-02, PNorm = 437.9930, GNorm = 0.6937, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.080057
Epoch 2707
Loss = 5.8425e-02, PNorm = 438.0594, GNorm = 1.3466, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.100309
Epoch 2708
Loss = 5.9837e-04, PNorm = 438.1324, GNorm = 0.0104, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.086096
Epoch 2709
Loss = 3.0657e-03, PNorm = 438.1792, GNorm = 0.0781, lr_0 = 9.9841e-04
Loss = 1.0288e-02, PNorm = 438.2356, GNorm = 0.0754, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.082046
Epoch 2710
Loss = 5.9770e-03, PNorm = 438.2894, GNorm = 0.0143, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.114488
Epoch 2711
Loss = 3.7857e-02, PNorm = 438.3188, GNorm = 1.7904, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.086264
Epoch 2712
Loss = 2.4594e-02, PNorm = 438.3791, GNorm = 0.5000, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.080806
Epoch 2713
Loss = 6.7842e-02, PNorm = 438.4654, GNorm = 1.2243, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.080864
Epoch 2714
Loss = 3.6060e-02, PNorm = 438.5507, GNorm = 2.1433, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.082561
Epoch 2715
Loss = 3.2919e-02, PNorm = 438.6563, GNorm = 3.1204, lr_0 = 9.9841e-04
Validation binary_cross_entropy = 0.144064
Epoch 2716
Loss = 4.8573e-02, PNorm = 438.7564, GNorm = 4.4439, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.083134
Epoch 2717
Loss = 1.8932e-02, PNorm = 438.8516, GNorm = 1.9122, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.132388
Epoch 2718
Loss = 6.0562e-03, PNorm = 438.9468, GNorm = 0.2213, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.079523
Epoch 2719
Loss = 4.6479e-02, PNorm = 439.0161, GNorm = 2.8520, lr_0 = 9.9840e-04
Loss = 3.1315e-02, PNorm = 439.1094, GNorm = 0.1783, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.078263
Epoch 2720
Loss = 2.3605e-02, PNorm = 439.1844, GNorm = 0.0254, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.073432
Epoch 2721
Loss = 9.0354e-03, PNorm = 439.2480, GNorm = 0.3968, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.090439
Epoch 2722
Loss = 2.6892e-02, PNorm = 439.3147, GNorm = 2.0231, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.094891
Epoch 2723
Loss = 2.1528e-02, PNorm = 439.3902, GNorm = 1.9509, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.103494
Epoch 2724
Loss = 3.1050e-03, PNorm = 439.4620, GNorm = 0.0244, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.128807
Epoch 2725
Loss = 3.1612e-02, PNorm = 439.5082, GNorm = 0.0304, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.083511
Epoch 2726
Loss = 6.9514e-02, PNorm = 439.5679, GNorm = 0.3612, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.081750
Epoch 2727
Loss = 2.6138e-02, PNorm = 439.6617, GNorm = 0.2046, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.098858
Epoch 2728
Loss = 3.5914e-02, PNorm = 439.7341, GNorm = 0.9613, lr_0 = 9.9840e-04
Validation binary_cross_entropy = 0.106978
Epoch 2729
Loss = 1.2335e-02, PNorm = 439.7936, GNorm = 0.4038, lr_0 = 9.9839e-04
Loss = 3.9485e-03, PNorm = 439.8434, GNorm = 0.0305, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.152223
Epoch 2730
Loss = 1.6045e-02, PNorm = 439.8823, GNorm = 0.3145, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.163768
Epoch 2731
Loss = 7.5703e-03, PNorm = 439.9371, GNorm = 0.2405, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.126661
Epoch 2732
Loss = 1.1579e-03, PNorm = 439.9930, GNorm = 0.2689, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.150414
Epoch 2733
Loss = 9.1357e-02, PNorm = 440.0490, GNorm = 2.2228, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.106769
Epoch 2734
Loss = 5.2448e-02, PNorm = 440.1497, GNorm = 0.3809, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.091247
Epoch 2735
Loss = 7.3607e-02, PNorm = 440.2690, GNorm = 0.9128, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.072711
Epoch 2736
Loss = 7.0330e-02, PNorm = 440.3942, GNorm = 2.5807, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.133981
Epoch 2737
Loss = 1.9319e-02, PNorm = 440.5405, GNorm = 0.7105, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.100423
Epoch 2738
Loss = 4.9792e-03, PNorm = 440.6420, GNorm = 0.2688, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.087466
Epoch 2739
Loss = 2.0703e-02, PNorm = 440.7315, GNorm = 0.5967, lr_0 = 9.9839e-04
Loss = 2.9625e-02, PNorm = 440.8226, GNorm = 0.3734, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.094463
Epoch 2740
Loss = 3.2040e-02, PNorm = 440.9017, GNorm = 0.4868, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.099370
Epoch 2741
Loss = 2.1904e-02, PNorm = 440.9703, GNorm = 0.5213, lr_0 = 9.9839e-04
Validation binary_cross_entropy = 0.098061
Epoch 2742
Loss = 2.9145e-02, PNorm = 441.0506, GNorm = 0.6930, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.108704
Epoch 2743
Loss = 3.3874e-02, PNorm = 441.1078, GNorm = 0.2087, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.085279
Epoch 2744
Loss = 1.2989e-02, PNorm = 441.1696, GNorm = 0.3445, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.097066
Epoch 2745
Loss = 2.0444e-02, PNorm = 441.2323, GNorm = 2.0685, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.102098
Epoch 2746
Loss = 3.2444e-02, PNorm = 441.2792, GNorm = 0.6269, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.076375
Epoch 2747
Loss = 1.0445e-02, PNorm = 441.3460, GNorm = 0.1056, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.081796
Epoch 2748
Loss = 6.9139e-03, PNorm = 441.4273, GNorm = 0.1343, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.090843
Epoch 2749
Loss = 2.7337e-02, PNorm = 441.4941, GNorm = 1.5942, lr_0 = 9.9838e-04
Loss = 3.5716e-03, PNorm = 441.5490, GNorm = 0.1327, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.089578
Epoch 2750
Loss = 3.3732e-02, PNorm = 441.5998, GNorm = 2.6357, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.098477
Epoch 2751
Loss = 2.9233e-02, PNorm = 441.6757, GNorm = 1.0179, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.089064
Epoch 2752
Loss = 3.3174e-02, PNorm = 441.7672, GNorm = 0.9798, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.077905
Epoch 2753
Loss = 3.5002e-02, PNorm = 441.8599, GNorm = 0.2221, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.083610
Epoch 2754
Loss = 4.2447e-02, PNorm = 441.9431, GNorm = 0.5369, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.091343
Epoch 2755
Loss = 5.7881e-03, PNorm = 442.0135, GNorm = 0.2768, lr_0 = 9.9838e-04
Validation binary_cross_entropy = 0.071329
Epoch 2756
Loss = 3.5658e-02, PNorm = 442.0730, GNorm = 0.2606, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.061202
Epoch 2757
Loss = 2.6370e-02, PNorm = 442.1463, GNorm = 1.2114, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.065785
Epoch 2758
Loss = 2.4710e-03, PNorm = 442.2210, GNorm = 0.0643, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.067677
Epoch 2759
Loss = 5.2553e-03, PNorm = 442.2745, GNorm = 0.1573, lr_0 = 9.9837e-04
Loss = 1.7583e-02, PNorm = 442.3189, GNorm = 0.0805, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.069356
Epoch 2760
Loss = 1.5158e-02, PNorm = 442.3874, GNorm = 0.1180, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.083656
Epoch 2761
Loss = 2.1786e-02, PNorm = 442.4390, GNorm = 0.1780, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.092421
Epoch 2762
Loss = 1.6291e-02, PNorm = 442.4848, GNorm = 0.1312, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.081710
Epoch 2763
Loss = 1.1753e-02, PNorm = 442.5282, GNorm = 1.0575, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.080515
Epoch 2764
Loss = 2.2459e-02, PNorm = 442.5747, GNorm = 0.1369, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.070723
Epoch 2765
Loss = 1.9540e-02, PNorm = 442.6247, GNorm = 0.9498, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.074611
Epoch 2766
Loss = 1.1853e-02, PNorm = 442.6880, GNorm = 0.1158, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.080217
Epoch 2767
Loss = 1.1441e-02, PNorm = 442.7434, GNorm = 0.0534, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.079197
Epoch 2768
Loss = 9.9751e-03, PNorm = 442.7903, GNorm = 0.5988, lr_0 = 9.9837e-04
Validation binary_cross_entropy = 0.082198
Epoch 2769
Loss = 1.3354e-03, PNorm = 442.8415, GNorm = 0.0393, lr_0 = 9.9836e-04
Loss = 2.7042e-02, PNorm = 442.8883, GNorm = 0.3608, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.069703
Epoch 2770
Loss = 3.1270e-02, PNorm = 442.9442, GNorm = 0.0659, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.083754
Epoch 2771
Loss = 1.5036e-02, PNorm = 442.9867, GNorm = 1.9557, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.086896
Epoch 2772
Loss = 5.4122e-03, PNorm = 443.0333, GNorm = 0.2950, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.093059
Epoch 2773
Loss = 3.6297e-03, PNorm = 443.0723, GNorm = 0.0223, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.108555
Epoch 2774
Loss = 1.0377e-01, PNorm = 443.1021, GNorm = 4.2964, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.082104
Epoch 2775
Loss = 4.5352e-02, PNorm = 443.1611, GNorm = 1.5337, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.078991
Epoch 2776
Loss = 8.5176e-03, PNorm = 443.2616, GNorm = 0.1587, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.115229
Epoch 2777
Loss = 6.2025e-02, PNorm = 443.3521, GNorm = 0.8972, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.090996
Epoch 2778
Loss = 5.3192e-02, PNorm = 443.4369, GNorm = 0.9349, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.083813
Epoch 2779
Loss = 7.9584e-03, PNorm = 443.5100, GNorm = 0.5542, lr_0 = 9.9836e-04
Loss = 3.7387e-02, PNorm = 443.5689, GNorm = 1.9745, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.081194
Epoch 2780
Loss = 3.6801e-02, PNorm = 443.6378, GNorm = 4.9500, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.066857
Epoch 2781
Loss = 2.6314e-02, PNorm = 443.7234, GNorm = 0.7453, lr_0 = 9.9836e-04
Validation binary_cross_entropy = 0.067776
Epoch 2782
Loss = 3.2604e-02, PNorm = 443.7914, GNorm = 3.0918, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.079054
Epoch 2783
Loss = 1.2670e-02, PNorm = 443.8530, GNorm = 3.0251, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.082337
Epoch 2784
Loss = 4.9805e-02, PNorm = 443.9037, GNorm = 1.7373, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.081043
Epoch 2785
Loss = 3.9706e-02, PNorm = 443.9604, GNorm = 0.2865, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.069143
Epoch 2786
Loss = 9.5489e-03, PNorm = 444.0282, GNorm = 0.3062, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.081431
Epoch 2787
Loss = 1.2603e-02, PNorm = 444.0991, GNorm = 0.3343, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.079882
Epoch 2788
Loss = 4.6424e-03, PNorm = 444.1813, GNorm = 0.1417, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.125086
Epoch 2789
Loss = 1.8111e-02, PNorm = 444.2592, GNorm = 0.4151, lr_0 = 9.9835e-04
Loss = 4.4978e-02, PNorm = 444.3413, GNorm = 1.2338, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.079058
Epoch 2790
Loss = 1.6225e-02, PNorm = 444.4425, GNorm = 0.6097, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.139393
Epoch 2791
Loss = 4.6948e-02, PNorm = 444.5072, GNorm = 0.1888, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.070411
Epoch 2792
Loss = 5.9698e-02, PNorm = 444.5955, GNorm = 3.1504, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.070379
Epoch 2793
Loss = 2.9967e-02, PNorm = 444.6755, GNorm = 0.2187, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.062758
Epoch 2794
Loss = 1.6676e-02, PNorm = 444.7632, GNorm = 0.6287, lr_0 = 9.9835e-04
Validation binary_cross_entropy = 0.073001
Epoch 2795
Loss = 1.5099e-02, PNorm = 444.8324, GNorm = 0.6712, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.066330
Epoch 2796
Loss = 9.5662e-03, PNorm = 444.8963, GNorm = 0.3831, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.071175
Epoch 2797
Loss = 6.5364e-02, PNorm = 444.9624, GNorm = 1.7249, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.074926
Epoch 2798
Loss = 3.6196e-02, PNorm = 445.0392, GNorm = 0.8342, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.086295
Epoch 2799
Loss = 7.6573e-03, PNorm = 445.1099, GNorm = 0.3395, lr_0 = 9.9834e-04
Loss = 1.6674e-02, PNorm = 445.1691, GNorm = 0.0046, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.082979
Epoch 2800
Loss = 7.0065e-02, PNorm = 445.2319, GNorm = 0.1544, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.063650
Epoch 2801
Loss = 5.5632e-02, PNorm = 445.3171, GNorm = 0.3069, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.057368
Epoch 2802
Loss = 3.0691e-02, PNorm = 445.4003, GNorm = 0.3713, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.057593
Epoch 2803
Loss = 1.3680e-02, PNorm = 445.4665, GNorm = 0.2215, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.060023
Epoch 2804
Loss = 9.7075e-03, PNorm = 445.5249, GNorm = 0.3639, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.079959
Epoch 2805
Loss = 2.6031e-03, PNorm = 445.5723, GNorm = 0.0135, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.072951
Epoch 2806
Loss = 1.4937e-03, PNorm = 445.6023, GNorm = 0.1059, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.067360
Epoch 2807
Loss = 1.7860e-03, PNorm = 445.6560, GNorm = 0.0629, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.062411
Epoch 2808
Loss = 1.5042e-02, PNorm = 445.6974, GNorm = 1.3062, lr_0 = 9.9834e-04
Validation binary_cross_entropy = 0.069467
Epoch 2809
Loss = 6.2593e-02, PNorm = 445.7583, GNorm = 1.3655, lr_0 = 9.9833e-04
Loss = 2.3931e-02, PNorm = 445.8046, GNorm = 0.0227, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.070311
Epoch 2810
Loss = 1.0353e-02, PNorm = 445.8433, GNorm = 0.1816, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.073641
Epoch 2811
Loss = 2.1639e-02, PNorm = 445.8805, GNorm = 0.2884, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.075991
Epoch 2812
Loss = 9.4632e-03, PNorm = 445.9136, GNorm = 1.8186, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.068330
Epoch 2813
Loss = 1.7923e-02, PNorm = 445.9529, GNorm = 0.2457, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.066022
Epoch 2814
Loss = 8.9943e-03, PNorm = 446.0069, GNorm = 0.4415, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.070022
Epoch 2815
Loss = 7.5279e-03, PNorm = 446.0706, GNorm = 0.1731, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.081047
Epoch 2816
Loss = 3.0027e-02, PNorm = 446.1501, GNorm = 0.0559, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.104135
Epoch 2817
Loss = 9.8214e-02, PNorm = 446.2206, GNorm = 1.5894, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.082909
Epoch 2818
Loss = 2.7996e-03, PNorm = 446.2709, GNorm = 0.0814, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.095148
Epoch 2819
Loss = 4.4921e-03, PNorm = 446.3403, GNorm = 0.1271, lr_0 = 9.9833e-04
Loss = 6.9072e-03, PNorm = 446.4040, GNorm = 1.0484, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.126915
Epoch 2820
Loss = 2.2747e-02, PNorm = 446.4483, GNorm = 0.3235, lr_0 = 9.9833e-04
Validation binary_cross_entropy = 0.113765
Epoch 2821
Loss = 4.1548e-02, PNorm = 446.4959, GNorm = 1.9673, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.098983
Epoch 2822
Loss = 1.4424e-02, PNorm = 446.5560, GNorm = 0.2079, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.129921
Epoch 2823
Loss = 3.3376e-02, PNorm = 446.6304, GNorm = 0.1100, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.120454
Epoch 2824
Loss = 3.4105e-02, PNorm = 446.6859, GNorm = 0.2005, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.115192
Epoch 2825
Loss = 1.2014e-02, PNorm = 446.7546, GNorm = 0.0737, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.164343
Epoch 2826
Loss = 6.5236e-02, PNorm = 446.8204, GNorm = 2.2118, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.101966
Epoch 2827
Loss = 3.6190e-02, PNorm = 446.8845, GNorm = 0.3700, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.127545
Epoch 2828
Loss = 2.7690e-03, PNorm = 446.9625, GNorm = 0.0491, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.062469
Epoch 2829
Loss = 1.6419e-02, PNorm = 447.0225, GNorm = 0.2977, lr_0 = 9.9832e-04
Loss = 3.2212e-02, PNorm = 447.1454, GNorm = 0.1110, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.079547
Epoch 2830
Loss = 3.6190e-02, PNorm = 447.2381, GNorm = 0.6286, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.090244
Epoch 2831
Loss = 3.0083e-02, PNorm = 447.2923, GNorm = 0.1982, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.059999
Epoch 2832
Loss = 1.7962e-02, PNorm = 447.3486, GNorm = 0.3254, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.065184
Epoch 2833
Loss = 2.7152e-02, PNorm = 447.4076, GNorm = 0.5537, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.067805
Epoch 2834
Loss = 6.2202e-03, PNorm = 447.4771, GNorm = 0.0717, lr_0 = 9.9832e-04
Validation binary_cross_entropy = 0.076726
Epoch 2835
Loss = 6.2577e-02, PNorm = 447.5488, GNorm = 0.0220, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.075209
Epoch 2836
Loss = 1.9857e-02, PNorm = 447.6119, GNorm = 0.1220, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.077727
Epoch 2837
Loss = 1.4894e-02, PNorm = 447.6670, GNorm = 0.1947, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.119909
Epoch 2838
Loss = 8.6323e-02, PNorm = 447.7472, GNorm = 0.0959, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.069393
Epoch 2839
Loss = 3.7254e-03, PNorm = 447.8149, GNorm = 0.1126, lr_0 = 9.9831e-04
Loss = 2.4966e-02, PNorm = 447.8803, GNorm = 0.3659, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.058816
Epoch 2840
Loss = 3.1919e-02, PNorm = 447.9612, GNorm = 0.5197, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.066089
Epoch 2841
Loss = 5.3105e-03, PNorm = 448.0252, GNorm = 0.2094, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.074425
Epoch 2842
Loss = 1.0178e-02, PNorm = 448.0804, GNorm = 0.0139, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.107868
Epoch 2843
Loss = 2.0584e-03, PNorm = 448.1170, GNorm = 0.0228, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.111326
Epoch 2844
Loss = 1.2072e-01, PNorm = 448.1360, GNorm = 0.0887, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.068212
Epoch 2845
Loss = 4.0218e-02, PNorm = 448.2034, GNorm = 1.1789, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.070791
Epoch 2846
Loss = 2.9258e-02, PNorm = 448.3180, GNorm = 0.7590, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.073534
Epoch 2847
Loss = 4.5870e-03, PNorm = 448.4128, GNorm = 0.1122, lr_0 = 9.9831e-04
Validation binary_cross_entropy = 0.062765
Epoch 2848
Loss = 1.2349e-02, PNorm = 448.4881, GNorm = 0.2400, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.072228
Epoch 2849
Loss = 2.2231e-03, PNorm = 448.5588, GNorm = 0.0653, lr_0 = 9.9830e-04
Loss = 4.1425e-03, PNorm = 448.6182, GNorm = 0.0017, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.112128
Epoch 2850
Loss = 4.7936e-02, PNorm = 448.6546, GNorm = 0.0784, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.080649
Epoch 2851
Loss = 3.8282e-02, PNorm = 448.7323, GNorm = 1.1396, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.088159
Epoch 2852
Loss = 2.3944e-02, PNorm = 448.8046, GNorm = 1.4419, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.070181
Epoch 2853
Loss = 1.3360e-02, PNorm = 448.8728, GNorm = 0.2689, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.068202
Epoch 2854
Loss = 1.1280e-02, PNorm = 448.9386, GNorm = 0.1766, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.080644
Epoch 2855
Loss = 3.8514e-02, PNorm = 448.9994, GNorm = 2.9508, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.083771
Epoch 2856
Loss = 1.4203e-02, PNorm = 449.0521, GNorm = 0.0119, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.085928
Epoch 2857
Loss = 4.1187e-03, PNorm = 449.0930, GNorm = 0.0245, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.086417
Epoch 2858
Loss = 1.6576e-03, PNorm = 449.1336, GNorm = 0.0262, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.109354
Epoch 2859
Loss = 1.8217e-04, PNorm = 449.1715, GNorm = 0.0067, lr_0 = 9.9830e-04
Loss = 8.9170e-03, PNorm = 449.2121, GNorm = 3.5353, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.198021
Epoch 2860
Loss = 8.8006e-02, PNorm = 449.2939, GNorm = 5.4911, lr_0 = 9.9830e-04
Validation binary_cross_entropy = 0.084871
Epoch 2861
Loss = 4.2228e-02, PNorm = 449.4534, GNorm = 0.8947, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.108275
Epoch 2862
Loss = 7.5323e-02, PNorm = 449.5805, GNorm = 2.8265, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.079544
Epoch 2863
Loss = 1.3118e-02, PNorm = 449.6832, GNorm = 1.1089, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.068124
Epoch 2864
Loss = 9.3618e-03, PNorm = 449.7947, GNorm = 0.0941, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.090704
Epoch 2865
Loss = 1.5207e-02, PNorm = 449.8760, GNorm = 0.7060, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.105680
Epoch 2866
Loss = 2.6832e-02, PNorm = 449.9233, GNorm = 0.1999, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.112532
Epoch 2867
Loss = 2.0787e-02, PNorm = 449.9694, GNorm = 1.1327, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.140306
Epoch 2868
Loss = 6.7715e-04, PNorm = 450.0082, GNorm = 0.0858, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.124353
Epoch 2869
Loss = 6.5625e-04, PNorm = 450.0361, GNorm = 0.0814, lr_0 = 9.9829e-04
Loss = 2.9542e-02, PNorm = 450.0797, GNorm = 2.3590, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.113019
Epoch 2870
Loss = 1.5496e-02, PNorm = 450.1377, GNorm = 0.4526, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.122178
Epoch 2871
Loss = 8.4756e-02, PNorm = 450.1975, GNorm = 0.0299, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.072947
Epoch 2872
Loss = 6.2007e-02, PNorm = 450.2858, GNorm = 0.5943, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.077548
Epoch 2873
Loss = 3.1614e-02, PNorm = 450.3785, GNorm = 1.3844, lr_0 = 9.9829e-04
Validation binary_cross_entropy = 0.081705
Epoch 2874
Loss = 6.8121e-03, PNorm = 450.4610, GNorm = 0.0228, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.106205
Epoch 2875
Loss = 1.4576e-02, PNorm = 450.5318, GNorm = 0.5661, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.109830
Epoch 2876
Loss = 4.3479e-03, PNorm = 450.5949, GNorm = 0.2940, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.105565
Epoch 2877
Loss = 1.3046e-01, PNorm = 450.6521, GNorm = 3.2522, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.109819
Epoch 2878
Loss = 1.1760e-01, PNorm = 450.7522, GNorm = 1.8187, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.098534
Epoch 2879
Loss = 5.6089e-02, PNorm = 450.8456, GNorm = 1.2367, lr_0 = 9.9828e-04
Loss = 2.5071e-02, PNorm = 450.9310, GNorm = 0.1704, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.090065
Epoch 2880
Loss = 6.0646e-02, PNorm = 451.0154, GNorm = 1.5124, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.079701
Epoch 2881
Loss = 1.9737e-02, PNorm = 451.1193, GNorm = 1.1760, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.075485
Epoch 2882
Loss = 2.2913e-02, PNorm = 451.1977, GNorm = 0.1626, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.074400
Epoch 2883
Loss = 2.7355e-02, PNorm = 451.2754, GNorm = 2.3249, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.095302
Epoch 2884
Loss = 4.9346e-02, PNorm = 451.3540, GNorm = 0.8850, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.067501
Epoch 2885
Loss = 4.3787e-02, PNorm = 451.4602, GNorm = 0.5605, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.078457
Epoch 2886
Loss = 2.1427e-02, PNorm = 451.5613, GNorm = 2.1182, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.088622
Epoch 2887
Loss = 7.3434e-03, PNorm = 451.6507, GNorm = 0.0469, lr_0 = 9.9828e-04
Validation binary_cross_entropy = 0.110607
Epoch 2888
Loss = 5.7370e-02, PNorm = 451.7251, GNorm = 0.2011, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.088224
Epoch 2889
Loss = 4.6070e-02, PNorm = 451.8024, GNorm = 0.7953, lr_0 = 9.9827e-04
Loss = 2.3549e-02, PNorm = 451.8911, GNorm = 1.3032, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.060363
Epoch 2890
Loss = 2.4061e-02, PNorm = 451.9772, GNorm = 1.0888, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.083393
Epoch 2891
Loss = 1.4865e-02, PNorm = 452.0482, GNorm = 0.2566, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.114374
Epoch 2892
Loss = 1.0016e-02, PNorm = 452.1094, GNorm = 0.0491, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.155324
Epoch 2893
Loss = 1.6566e-02, PNorm = 452.1507, GNorm = 0.7581, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.111607
Epoch 2894
Loss = 2.3048e-02, PNorm = 452.2091, GNorm = 0.0379, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.103342
Epoch 2895
Loss = 6.0676e-02, PNorm = 452.2889, GNorm = 0.0026, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.095597
Epoch 2896
Loss = 2.2890e-02, PNorm = 452.3707, GNorm = 0.3726, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.077799
Epoch 2897
Loss = 2.4760e-01, PNorm = 452.4426, GNorm = 3.7144, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.060274
Epoch 2898
Loss = 3.2103e-02, PNorm = 452.5243, GNorm = 0.7860, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.069562
Epoch 2899
Loss = 7.3716e-03, PNorm = 452.6064, GNorm = 0.2587, lr_0 = 9.9827e-04
Loss = 2.1381e-02, PNorm = 452.6720, GNorm = 0.3580, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.069140
Epoch 2900
Loss = 1.7203e-02, PNorm = 452.7186, GNorm = 0.0401, lr_0 = 9.9827e-04
Validation binary_cross_entropy = 0.105673
Epoch 2901
Loss = 4.4230e-02, PNorm = 452.7576, GNorm = 1.1320, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.065855
Epoch 2902
Loss = 3.1964e-02, PNorm = 452.8200, GNorm = 0.6145, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.064649
Epoch 2903
Loss = 3.5651e-02, PNorm = 452.8942, GNorm = 2.2394, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.077618
Epoch 2904
Loss = 2.2496e-02, PNorm = 452.9670, GNorm = 0.0390, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.074156
Epoch 2905
Loss = 7.9916e-02, PNorm = 453.0296, GNorm = 2.4271, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.064595
Epoch 2906
Loss = 3.7552e-02, PNorm = 453.0961, GNorm = 0.1117, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.086883
Epoch 2907
Loss = 1.1427e-02, PNorm = 453.1797, GNorm = 0.0315, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.125109
Epoch 2908
Loss = 1.7365e-03, PNorm = 453.2399, GNorm = 0.0215, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.073543
Epoch 2909
Loss = 2.5177e-03, PNorm = 453.2779, GNorm = 0.0591, lr_0 = 9.9826e-04
Loss = 5.0137e-02, PNorm = 453.3705, GNorm = 0.3952, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.091941
Epoch 2910
Loss = 2.0515e-02, PNorm = 453.4611, GNorm = 1.9187, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.140050
Epoch 2911
Loss = 4.7695e-02, PNorm = 453.5394, GNorm = 1.1565, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.070788
Epoch 2912
Loss = 1.1654e-02, PNorm = 453.6573, GNorm = 0.0313, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.110937
Epoch 2913
Loss = 4.2418e-02, PNorm = 453.7299, GNorm = 0.8809, lr_0 = 9.9826e-04
Validation binary_cross_entropy = 0.090590
Epoch 2914
Loss = 3.3709e-02, PNorm = 453.8030, GNorm = 0.3894, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.116420
Epoch 2915
Loss = 2.1678e-02, PNorm = 453.8907, GNorm = 0.8892, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.133257
Epoch 2916
Loss = 7.3963e-03, PNorm = 453.9639, GNorm = 0.9577, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.130346
Epoch 2917
Loss = 2.3441e-02, PNorm = 454.0146, GNorm = 0.2601, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.114154
Epoch 2918
Loss = 5.1769e-02, PNorm = 454.0716, GNorm = 2.2260, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.144187
Epoch 2919
Loss = 4.3758e-03, PNorm = 454.1327, GNorm = 0.3794, lr_0 = 9.9825e-04
Loss = 1.6586e-02, PNorm = 454.2026, GNorm = 0.3318, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.114010
Epoch 2920
Loss = 3.1745e-02, PNorm = 454.2771, GNorm = 0.0152, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.106413
Epoch 2921
Loss = 5.9358e-02, PNorm = 454.3469, GNorm = 0.3254, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.075768
Epoch 2922
Loss = 8.9429e-02, PNorm = 454.4367, GNorm = 0.4681, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.052159
Epoch 2923
Loss = 5.5055e-02, PNorm = 454.5766, GNorm = 0.7923, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.078923
Epoch 2924
Loss = 4.1439e-02, PNorm = 454.6896, GNorm = 0.0730, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.066818
Epoch 2925
Loss = 8.3702e-03, PNorm = 454.7739, GNorm = 0.2859, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.094742
Epoch 2926
Loss = 2.2409e-02, PNorm = 454.8485, GNorm = 2.4321, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.153054
Epoch 2927
Loss = 6.6765e-04, PNorm = 454.9008, GNorm = 0.0357, lr_0 = 9.9825e-04
Validation binary_cross_entropy = 0.157010
Epoch 2928
Loss = 1.3459e-03, PNorm = 454.9397, GNorm = 0.3341, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.193875
Epoch 2929
Loss = 2.3257e-01, PNorm = 454.9869, GNorm = 2.9555, lr_0 = 9.9824e-04
Loss = 4.4556e-02, PNorm = 455.0577, GNorm = 0.0553, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.174232
Epoch 2930
Loss = 6.6079e-02, PNorm = 455.1530, GNorm = 0.6898, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.123974
Epoch 2931
Loss = 9.0500e-02, PNorm = 455.2754, GNorm = 1.2318, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.134684
Epoch 2932
Loss = 6.3181e-02, PNorm = 455.4044, GNorm = 0.5355, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.087617
Epoch 2933
Loss = 3.7250e-02, PNorm = 455.5206, GNorm = 0.1156, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.112249
Epoch 2934
Loss = 2.0134e-02, PNorm = 455.6083, GNorm = 0.2881, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.141231
Epoch 2935
Loss = 1.2349e-02, PNorm = 455.6691, GNorm = 0.0685, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.170986
Epoch 2936
Loss = 4.4336e-03, PNorm = 455.7180, GNorm = 0.0065, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.176904
Epoch 2937
Loss = 7.7643e-03, PNorm = 455.7553, GNorm = 0.4400, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.150475
Epoch 2938
Loss = 1.2207e-02, PNorm = 455.8040, GNorm = 0.2109, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.190275
Epoch 2939
Loss = 5.1596e-02, PNorm = 455.8644, GNorm = 2.0235, lr_0 = 9.9824e-04
Loss = 2.5250e-02, PNorm = 455.9226, GNorm = 2.0075, lr_0 = 9.9824e-04
Validation binary_cross_entropy = 0.255319
Epoch 2940
Loss = 2.1311e-02, PNorm = 455.9925, GNorm = 1.8611, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.163203
Epoch 2941
Loss = 6.3560e-02, PNorm = 456.0617, GNorm = 1.7940, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.082199
Epoch 2942
Loss = 6.5230e-02, PNorm = 456.1587, GNorm = 1.1664, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.082139
Epoch 2943
Loss = 6.3788e-02, PNorm = 456.2654, GNorm = 1.2220, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.062140
Epoch 2944
Loss = 2.0937e-02, PNorm = 456.4059, GNorm = 0.1370, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.093744
Epoch 2945
Loss = 3.2240e-01, PNorm = 456.5639, GNorm = 0.9319, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.104395
Epoch 2946
Loss = 3.1515e-02, PNorm = 456.8683, GNorm = 0.9648, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.068239
Epoch 2947
Loss = 2.4047e-02, PNorm = 457.0654, GNorm = 0.7755, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.075341
Epoch 2948
Loss = 1.5347e-02, PNorm = 457.1922, GNorm = 0.6290, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.079861
Epoch 2949
Loss = 3.7339e-02, PNorm = 457.2902, GNorm = 0.9258, lr_0 = 9.9823e-04
Loss = 2.7668e-02, PNorm = 457.3557, GNorm = 0.2515, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.090843
Epoch 2950
Loss = 3.4510e-02, PNorm = 457.4106, GNorm = 2.0408, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.078523
Epoch 2951
Loss = 4.9150e-02, PNorm = 457.4739, GNorm = 1.3188, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.081857
Epoch 2952
Loss = 2.0377e-02, PNorm = 457.5307, GNorm = 0.9798, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.083590
Epoch 2953
Loss = 1.6651e-02, PNorm = 457.5800, GNorm = 0.0454, lr_0 = 9.9823e-04
Validation binary_cross_entropy = 0.089438
Epoch 2954
Loss = 2.2274e-02, PNorm = 457.6326, GNorm = 0.8945, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.112381
Epoch 2955
Loss = 2.9305e-02, PNorm = 457.6862, GNorm = 0.9389, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.104975
Epoch 2956
Loss = 1.2453e-02, PNorm = 457.7340, GNorm = 0.0253, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.119507
Epoch 2957
Loss = 2.0433e-02, PNorm = 457.7840, GNorm = 0.0539, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.102287
Epoch 2958
Loss = 4.2226e-03, PNorm = 457.8274, GNorm = 0.0340, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.128913
Epoch 2959
Loss = 1.5489e-01, PNorm = 457.8916, GNorm = 1.6020, lr_0 = 9.9822e-04
Loss = 4.3273e-02, PNorm = 457.9516, GNorm = 0.3443, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.106686
Epoch 2960
Loss = 7.7111e-02, PNorm = 458.0239, GNorm = 0.9163, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.084549
Epoch 2961
Loss = 2.2703e-02, PNorm = 458.1214, GNorm = 0.1332, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.104410
Epoch 2962
Loss = 3.7002e-02, PNorm = 458.1889, GNorm = 1.1292, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.080799
Epoch 2963
Loss = 7.2355e-02, PNorm = 458.2645, GNorm = 0.1898, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.069280
Epoch 2964
Loss = 2.3694e-02, PNorm = 458.3403, GNorm = 0.1135, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.076595
Epoch 2965
Loss = 1.6389e-02, PNorm = 458.4037, GNorm = 0.5513, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.091952
Epoch 2966
Loss = 2.0429e-02, PNorm = 458.4510, GNorm = 0.9117, lr_0 = 9.9822e-04
Validation binary_cross_entropy = 0.090918
Epoch 2967
Loss = 2.7558e-03, PNorm = 458.4914, GNorm = 0.2437, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.087342
Epoch 2968
Loss = 4.2959e-03, PNorm = 458.5373, GNorm = 0.4427, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.129608
Epoch 2969
Loss = 1.7004e-02, PNorm = 458.5816, GNorm = 1.0613, lr_0 = 9.9821e-04
Loss = 5.1830e-02, PNorm = 458.6204, GNorm = 2.9907, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.122267
Epoch 2970
Loss = 2.3280e-02, PNorm = 458.6849, GNorm = 2.6807, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.105923
Epoch 2971
Loss = 4.6281e-02, PNorm = 458.7524, GNorm = 2.7583, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.113747
Epoch 2972
Loss = 3.0510e-02, PNorm = 458.8067, GNorm = 1.1692, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.089552
Epoch 2973
Loss = 5.2841e-02, PNorm = 458.8679, GNorm = 0.3951, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.099018
Epoch 2974
Loss = 6.6702e-02, PNorm = 458.9506, GNorm = 0.2439, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.069853
Epoch 2975
Loss = 8.7379e-02, PNorm = 459.0743, GNorm = 0.9069, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.121024
Epoch 2976
Loss = 2.6697e-02, PNorm = 459.1829, GNorm = 0.6808, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.096336
Epoch 2977
Loss = 4.6139e-02, PNorm = 459.2471, GNorm = 0.8930, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.112338
Epoch 2978
Loss = 7.7687e-03, PNorm = 459.3061, GNorm = 0.4315, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.101609
Epoch 2979
Loss = 4.1692e-03, PNorm = 459.3674, GNorm = 0.1895, lr_0 = 9.9821e-04
Loss = 2.4023e-02, PNorm = 459.4358, GNorm = 0.1709, lr_0 = 9.9821e-04
Validation binary_cross_entropy = 0.130730
Epoch 2980
Loss = 4.7448e-02, PNorm = 459.5190, GNorm = 0.0632, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.154074
Epoch 2981
Loss = 1.6114e-02, PNorm = 459.5811, GNorm = 0.5875, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.127267
Epoch 2982
Loss = 1.1670e-02, PNorm = 459.6296, GNorm = 3.5887, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.112775
Epoch 2983
Loss = 2.8740e-02, PNorm = 459.6794, GNorm = 1.0616, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.098104
Epoch 2984
Loss = 1.8557e-02, PNorm = 459.7619, GNorm = 2.3561, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.129035
Epoch 2985
Loss = 4.5130e-02, PNorm = 459.8372, GNorm = 0.1164, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.133948
Epoch 2986
Loss = 4.4067e-03, PNorm = 459.8984, GNorm = 0.1128, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.111427
Epoch 2987
Loss = 1.7901e-02, PNorm = 459.9548, GNorm = 0.7192, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.126903
Epoch 2988
Loss = 4.4631e-03, PNorm = 460.0129, GNorm = 0.1295, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.134923
Epoch 2989
Loss = 3.9347e-03, PNorm = 460.0715, GNorm = 0.1021, lr_0 = 9.9820e-04
Loss = 1.8101e-02, PNorm = 460.1282, GNorm = 0.6781, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.109838
Epoch 2990
Loss = 1.3572e-02, PNorm = 460.1842, GNorm = 0.1371, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.125738
Epoch 2991
Loss = 3.1874e-02, PNorm = 460.2412, GNorm = 0.3451, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.173355
Epoch 2992
Loss = 1.5694e-01, PNorm = 460.2921, GNorm = 0.0183, lr_0 = 9.9820e-04
Validation binary_cross_entropy = 0.098932
Epoch 2993
Loss = 3.3389e-02, PNorm = 460.4097, GNorm = 0.4239, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.105984
Epoch 2994
Loss = 9.0821e-02, PNorm = 460.5437, GNorm = 0.3298, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.074141
Epoch 2995
Loss = 3.3814e-02, PNorm = 460.6460, GNorm = 0.5259, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.075296
Epoch 2996
Loss = 5.4558e-02, PNorm = 460.7365, GNorm = 3.0777, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.127942
Epoch 2997
Loss = 4.7772e-02, PNorm = 460.8146, GNorm = 1.1610, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.095591
Epoch 2998
Loss = 1.2356e-02, PNorm = 460.8750, GNorm = 0.4043, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.110084
Epoch 2999
Loss = 2.1277e-03, PNorm = 460.9341, GNorm = 0.0633, lr_0 = 9.9819e-04
Loss = 1.9592e-02, PNorm = 460.9807, GNorm = 0.6655, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.119799
Epoch 3000
Loss = 2.1478e-02, PNorm = 461.0184, GNorm = 0.3419, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.107236
Epoch 3001
Loss = 2.6530e-02, PNorm = 461.0697, GNorm = 0.3477, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.139680
Epoch 3002
Loss = 1.3730e-01, PNorm = 461.1344, GNorm = 0.4320, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.084063
Epoch 3003
Loss = 7.1492e-02, PNorm = 461.2335, GNorm = 0.3447, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.111257
Epoch 3004
Loss = 5.1130e-02, PNorm = 461.3067, GNorm = 0.9466, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.069657
Epoch 3005
Loss = 3.4213e-02, PNorm = 461.3805, GNorm = 0.2182, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.076547
Epoch 3006
Loss = 1.6478e-02, PNorm = 461.4640, GNorm = 0.4567, lr_0 = 9.9819e-04
Validation binary_cross_entropy = 0.092892
Epoch 3007
Loss = 4.7863e-03, PNorm = 461.5297, GNorm = 0.0996, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.101195
Epoch 3008
Loss = 1.7397e-02, PNorm = 461.5795, GNorm = 0.0568, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.100275
Epoch 3009
Loss = 7.1683e-02, PNorm = 461.6216, GNorm = 4.5583, lr_0 = 9.9818e-04
Loss = 2.9464e-02, PNorm = 461.6821, GNorm = 0.3004, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.093145
Epoch 3010
Loss = 2.6216e-02, PNorm = 461.7472, GNorm = 0.1494, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.111870
Epoch 3011
Loss = 2.1633e-02, PNorm = 461.8091, GNorm = 0.9499, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.106469
Epoch 3012
Loss = 5.4479e-02, PNorm = 461.8714, GNorm = 0.1615, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.100393
Epoch 3013
Loss = 3.0625e-02, PNorm = 461.9336, GNorm = 0.1370, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.070764
Epoch 3014
Loss = 3.0567e-02, PNorm = 462.0034, GNorm = 0.0757, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.061380
Epoch 3015
Loss = 2.7709e-02, PNorm = 462.1065, GNorm = 0.1548, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.083054
Epoch 3016
Loss = 4.5234e-02, PNorm = 462.2117, GNorm = 1.5907, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.067524
Epoch 3017
Loss = 2.5625e-02, PNorm = 462.2920, GNorm = 0.1864, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.064391
Epoch 3018
Loss = 6.4403e-03, PNorm = 462.3565, GNorm = 0.6374, lr_0 = 9.9818e-04
Validation binary_cross_entropy = 0.079140
Epoch 3019
Loss = 1.3020e-03, PNorm = 462.4101, GNorm = 0.0611, lr_0 = 9.9818e-04
Loss = 2.1028e-02, PNorm = 462.4446, GNorm = 1.3825, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.078692
Epoch 3020
Loss = 2.8275e-02, PNorm = 462.5094, GNorm = 1.6935, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.083848
Epoch 3021
Loss = 3.4247e-02, PNorm = 462.5788, GNorm = 0.9918, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.115030
Epoch 3022
Loss = 3.4579e-02, PNorm = 462.6253, GNorm = 0.9345, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.089334
Epoch 3023
Loss = 1.0532e-02, PNorm = 462.6749, GNorm = 1.5412, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.119735
Epoch 3024
Loss = 1.2552e-02, PNorm = 462.7210, GNorm = 0.7166, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.149693
Epoch 3025
Loss = 2.6983e-02, PNorm = 462.7610, GNorm = 0.0523, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.081072
Epoch 3026
Loss = 6.9043e-02, PNorm = 462.8442, GNorm = 3.3361, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.173259
Epoch 3027
Loss = 3.5670e-02, PNorm = 462.9525, GNorm = 0.8585, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.123215
Epoch 3028
Loss = 4.1969e-02, PNorm = 463.0173, GNorm = 0.7748, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.095816
Epoch 3029
Loss = 7.2516e-03, PNorm = 463.0899, GNorm = 0.2052, lr_0 = 9.9817e-04
Loss = 2.4256e-02, PNorm = 463.1959, GNorm = 0.4869, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.241794
Epoch 3030
Loss = 4.7434e-02, PNorm = 463.2699, GNorm = 0.2543, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.194528
Epoch 3031
Loss = 7.5135e-02, PNorm = 463.3539, GNorm = 23.0687, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.071677
Epoch 3032
Loss = 6.0560e-02, PNorm = 463.5403, GNorm = 0.1239, lr_0 = 9.9817e-04
Validation binary_cross_entropy = 0.121129
Epoch 3033
Loss = 3.3311e-02, PNorm = 463.6880, GNorm = 0.2868, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.065417
Epoch 3034
Loss = 5.1223e-02, PNorm = 463.8030, GNorm = 0.7000, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.091443
Epoch 3035
Loss = 5.2082e-02, PNorm = 463.9024, GNorm = 3.4634, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.093048
Epoch 3036
Loss = 2.7891e-03, PNorm = 463.9859, GNorm = 0.0199, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.072124
Epoch 3037
Loss = 4.1763e-02, PNorm = 464.0435, GNorm = 0.2536, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.072298
Epoch 3038
Loss = 2.5855e-02, PNorm = 464.1172, GNorm = 0.3366, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.101499
Epoch 3039
Loss = 7.9862e-03, PNorm = 464.1969, GNorm = 0.7130, lr_0 = 9.9816e-04
Loss = 7.9493e-03, PNorm = 464.2506, GNorm = 0.2571, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.094843
Epoch 3040
Loss = 4.3361e-02, PNorm = 464.3026, GNorm = 0.3566, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.086810
Epoch 3041
Loss = 7.7086e-03, PNorm = 464.3572, GNorm = 0.0421, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.102125
Epoch 3042
Loss = 1.0737e-02, PNorm = 464.4012, GNorm = 0.0348, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.116742
Epoch 3043
Loss = 2.6778e-02, PNorm = 464.4402, GNorm = 0.0082, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.117943
Epoch 3044
Loss = 2.3441e-02, PNorm = 464.4944, GNorm = 0.1749, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.143506
Epoch 3045
Loss = 1.0156e-02, PNorm = 464.6171, GNorm = 1.2610, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.220939
Epoch 3046
Loss = 9.9314e-02, PNorm = 464.7240, GNorm = 1.1061, lr_0 = 9.9816e-04
Validation binary_cross_entropy = 0.123373
Epoch 3047
Loss = 1.9194e-02, PNorm = 464.8583, GNorm = 0.5813, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.142557
Epoch 3048
Loss = 2.2275e-02, PNorm = 464.9769, GNorm = 0.1078, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.146301
Epoch 3049
Loss = 2.5625e-02, PNorm = 465.0771, GNorm = 0.6335, lr_0 = 9.9815e-04
Loss = 6.5279e-02, PNorm = 465.1614, GNorm = 1.6669, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.106320
Epoch 3050
Loss = 2.3756e-02, PNorm = 465.2421, GNorm = 0.2982, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.153369
Epoch 3051
Loss = 3.6666e-02, PNorm = 465.3117, GNorm = 0.5686, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.205020
Epoch 3052
Loss = 1.6912e-02, PNorm = 465.3579, GNorm = 2.4765, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.224628
Epoch 3053
Loss = 1.5540e-02, PNorm = 465.3980, GNorm = 0.1249, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.086018
Epoch 3054
Loss = 3.4323e-02, PNorm = 465.5771, GNorm = 0.6660, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.103403
Epoch 3055
Loss = 6.4591e-02, PNorm = 465.7654, GNorm = 1.1069, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.089764
Epoch 3056
Loss = 2.6037e-02, PNorm = 465.9236, GNorm = 0.6877, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.068977
Epoch 3057
Loss = 6.9789e-02, PNorm = 466.0474, GNorm = 1.1654, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.131557
Epoch 3058
Loss = 4.6092e-02, PNorm = 466.1925, GNorm = 4.1165, lr_0 = 9.9815e-04
Validation binary_cross_entropy = 0.069528
Epoch 3059
Loss = 1.3684e-02, PNorm = 466.3288, GNorm = 0.8035, lr_0 = 9.9815e-04
Loss = 5.8449e-02, PNorm = 466.4560, GNorm = 0.3135, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.075364
Epoch 3060
Loss = 4.1347e-02, PNorm = 466.5553, GNorm = 1.5588, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.086430
Epoch 3061
Loss = 3.6499e-02, PNorm = 466.6281, GNorm = 1.1251, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.085267
Epoch 3062
Loss = 3.5909e-02, PNorm = 466.7070, GNorm = 0.1114, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.080181
Epoch 3063
Loss = 3.7967e-02, PNorm = 466.7955, GNorm = 0.5459, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.094243
Epoch 3064
Loss = 1.2097e-02, PNorm = 466.8738, GNorm = 0.5089, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.080855
Epoch 3065
Loss = 9.3709e-03, PNorm = 466.9357, GNorm = 1.3127, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.091394
Epoch 3066
Loss = 2.0058e-03, PNorm = 467.0185, GNorm = 0.0229, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.111054
Epoch 3067
Loss = 2.4830e-02, PNorm = 467.0765, GNorm = 0.0599, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.082874
Epoch 3068
Loss = 2.0710e-02, PNorm = 467.1205, GNorm = 0.1919, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.083232
Epoch 3069
Loss = 2.5405e-02, PNorm = 467.2010, GNorm = 1.1310, lr_0 = 9.9814e-04
Loss = 3.9988e-02, PNorm = 467.2804, GNorm = 0.0549, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.103683
Epoch 3070
Loss = 1.8598e-02, PNorm = 467.3319, GNorm = 0.9195, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.086226
Epoch 3071
Loss = 2.1895e-02, PNorm = 467.3828, GNorm = 0.3412, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.092096
Epoch 3072
Loss = 6.6662e-03, PNorm = 467.4372, GNorm = 0.0073, lr_0 = 9.9814e-04
Validation binary_cross_entropy = 0.109396
Epoch 3073
Loss = 4.3832e-02, PNorm = 467.4755, GNorm = 0.4596, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.080529
Epoch 3074
Loss = 4.3373e-02, PNorm = 467.5723, GNorm = 0.3100, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.096555
Epoch 3075
Loss = 2.5747e-02, PNorm = 467.6725, GNorm = 0.0292, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.077426
Epoch 3076
Loss = 3.3949e-02, PNorm = 467.7527, GNorm = 1.4262, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.087540
Epoch 3077
Loss = 2.4222e-02, PNorm = 467.8621, GNorm = 1.0927, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.096553
Epoch 3078
Loss = 5.9644e-02, PNorm = 467.9427, GNorm = 2.4634, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.080498
Epoch 3079
Loss = 4.5347e-03, PNorm = 468.0062, GNorm = 0.0958, lr_0 = 9.9813e-04
Loss = 1.7121e-02, PNorm = 468.0841, GNorm = 0.3762, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.108769
Epoch 3080
Loss = 2.7227e-03, PNorm = 468.1549, GNorm = 0.0154, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.214323
Epoch 3081
Loss = 1.1031e-02, PNorm = 468.2026, GNorm = 0.1411, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.205203
Epoch 3082
Loss = 4.5033e-02, PNorm = 468.2484, GNorm = 1.8124, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.113655
Epoch 3083
Loss = 3.7777e-02, PNorm = 468.3499, GNorm = 0.3218, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.128987
Epoch 3084
Loss = 7.6725e-02, PNorm = 468.4509, GNorm = 0.7890, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.079660
Epoch 3085
Loss = 1.6876e-02, PNorm = 468.5417, GNorm = 0.7603, lr_0 = 9.9813e-04
Validation binary_cross_entropy = 0.072751
Epoch 3086
Loss = 2.1115e-02, PNorm = 468.6223, GNorm = 0.4946, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.073754
Epoch 3087
Loss = 3.2702e-03, PNorm = 468.6881, GNorm = 0.0553, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.079992
Epoch 3088
Loss = 1.1498e-02, PNorm = 468.7479, GNorm = 0.0189, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.114262
Epoch 3089
Loss = 3.2197e-02, PNorm = 468.7972, GNorm = 3.3185, lr_0 = 9.9812e-04
Loss = 3.9259e-02, PNorm = 468.8385, GNorm = 2.6618, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.085492
Epoch 3090
Loss = 2.4350e-02, PNorm = 468.9109, GNorm = 0.1712, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.090578
Epoch 3091
Loss = 2.2667e-02, PNorm = 468.9762, GNorm = 0.5422, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.104938
Epoch 3092
Loss = 8.1333e-02, PNorm = 469.0262, GNorm = 0.8603, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.085620
Epoch 3093
Loss = 4.1005e-02, PNorm = 469.0883, GNorm = 6.1306, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.088211
Epoch 3094
Loss = 1.4834e-02, PNorm = 469.1582, GNorm = 0.1429, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.086325
Epoch 3095
Loss = 3.8730e-03, PNorm = 469.2171, GNorm = 0.2996, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.097273
Epoch 3096
Loss = 5.6980e-03, PNorm = 469.2848, GNorm = 0.4558, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.101982
Epoch 3097
Loss = 5.6335e-03, PNorm = 469.3328, GNorm = 0.1945, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.095660
Epoch 3098
Loss = 1.6196e-01, PNorm = 469.3708, GNorm = 5.8561, lr_0 = 9.9812e-04
Validation binary_cross_entropy = 0.073282
Epoch 3099
Loss = 7.0156e-03, PNorm = 469.4290, GNorm = 0.1197, lr_0 = 9.9812e-04
Loss = 2.5962e-01, PNorm = 469.5493, GNorm = 0.7968, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.063219
Epoch 3100
Loss = 1.4603e-01, PNorm = 469.7124, GNorm = 0.3964, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.082680
Epoch 3101
Loss = 3.6836e-02, PNorm = 469.8392, GNorm = 1.5362, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.076307
Epoch 3102
Loss = 1.0729e-02, PNorm = 469.9206, GNorm = 0.6031, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.094838
Epoch 3103
Loss = 7.5577e-03, PNorm = 469.9714, GNorm = 0.0125, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.097833
Epoch 3104
Loss = 7.0086e-03, PNorm = 470.0048, GNorm = 0.0141, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.103298
Epoch 3105
Loss = 5.1680e-02, PNorm = 470.0532, GNorm = 0.0677, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.096885
Epoch 3106
Loss = 2.0957e-02, PNorm = 470.1066, GNorm = 1.6988, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.100108
Epoch 3107
Loss = 1.9957e-02, PNorm = 470.1483, GNorm = 1.2565, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.087797
Epoch 3108
Loss = 6.3064e-03, PNorm = 470.1840, GNorm = 0.1294, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.097507
Epoch 3109
Loss = 5.0081e-02, PNorm = 470.2423, GNorm = 1.4252, lr_0 = 9.9811e-04
Loss = 2.6179e-02, PNorm = 470.2985, GNorm = 0.4666, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.096192
Epoch 3110
Loss = 4.4482e-02, PNorm = 470.3447, GNorm = 0.4057, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.089416
Epoch 3111
Loss = 1.5074e-02, PNorm = 470.3964, GNorm = 0.4947, lr_0 = 9.9811e-04
Validation binary_cross_entropy = 0.098305
Epoch 3112
Loss = 1.0660e-02, PNorm = 470.4470, GNorm = 0.0391, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.125138
Epoch 3113
Loss = 6.0663e-03, PNorm = 470.4788, GNorm = 0.2896, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.109783
Epoch 3114
Loss = 5.0697e-02, PNorm = 470.5030, GNorm = 0.8220, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.093394
Epoch 3115
Loss = 6.1544e-03, PNorm = 470.5715, GNorm = 0.3861, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.101373
Epoch 3116
Loss = 3.8899e-02, PNorm = 470.6210, GNorm = 0.0989, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.090424
Epoch 3117
Loss = 2.1637e-02, PNorm = 470.6585, GNorm = 0.6000, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.093141
Epoch 3118
Loss = 7.1070e-03, PNorm = 470.7005, GNorm = 0.1016, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.110091
Epoch 3119
Loss = 2.4129e-03, PNorm = 470.7415, GNorm = 0.2221, lr_0 = 9.9810e-04
Loss = 3.5589e-02, PNorm = 470.7774, GNorm = 0.0121, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.099058
Epoch 3120
Loss = 3.2358e-02, PNorm = 470.8112, GNorm = 0.0828, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.084001
Epoch 3121
Loss = 2.2927e-02, PNorm = 470.8613, GNorm = 0.0449, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.094174
Epoch 3122
Loss = 2.1280e-02, PNorm = 470.9028, GNorm = 2.3478, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.095161
Epoch 3123
Loss = 2.3302e-02, PNorm = 470.9491, GNorm = 1.2340, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.093542
Epoch 3124
Loss = 3.5811e-02, PNorm = 470.9888, GNorm = 6.8747, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.087074
Epoch 3125
Loss = 3.4056e-02, PNorm = 471.0376, GNorm = 0.9641, lr_0 = 9.9810e-04
Validation binary_cross_entropy = 0.089156
Epoch 3126
Loss = 1.1346e-02, PNorm = 471.1230, GNorm = 0.1774, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.111321
Epoch 3127
Loss = 5.0488e-03, PNorm = 471.1846, GNorm = 0.0468, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.091633
Epoch 3128
Loss = 8.8265e-03, PNorm = 471.2275, GNorm = 0.3390, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.089990
Epoch 3129
Loss = 2.8088e-02, PNorm = 471.3030, GNorm = 1.4231, lr_0 = 9.9809e-04
Loss = 9.0807e-03, PNorm = 471.3676, GNorm = 0.4024, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.106691
Epoch 3130
Loss = 4.1803e-02, PNorm = 471.3998, GNorm = 0.7800, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.085598
Epoch 3131
Loss = 1.2933e-02, PNorm = 471.4435, GNorm = 0.0946, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.085580
Epoch 3132
Loss = 1.7094e-02, PNorm = 471.4791, GNorm = 0.0427, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.086504
Epoch 3133
Loss = 9.2764e-03, PNorm = 471.5035, GNorm = 0.0538, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.084522
Epoch 3134
Loss = 8.7054e-03, PNorm = 471.5474, GNorm = 0.5255, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.070548
Epoch 3135
Loss = 1.5998e-02, PNorm = 471.6007, GNorm = 0.3680, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.092938
Epoch 3136
Loss = 4.7477e-03, PNorm = 471.6557, GNorm = 0.4934, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.086931
Epoch 3137
Loss = 5.9100e-02, PNorm = 471.7037, GNorm = 2.9224, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.087234
Epoch 3138
Loss = 2.9528e-02, PNorm = 471.7688, GNorm = 0.9919, lr_0 = 9.9809e-04
Validation binary_cross_entropy = 0.084676
Epoch 3139
Loss = 1.0405e-03, PNorm = 471.8311, GNorm = 0.0403, lr_0 = 9.9808e-04
Loss = 3.4273e-02, PNorm = 471.8908, GNorm = 0.8955, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.108293
Epoch 3140
Loss = 1.8920e-02, PNorm = 471.9425, GNorm = 1.8720, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.096278
Epoch 3141
Loss = 2.2142e-02, PNorm = 471.9962, GNorm = 0.7466, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.089192
Epoch 3142
Loss = 4.1489e-02, PNorm = 472.0758, GNorm = 1.5392, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.088417
Epoch 3143
Loss = 3.8757e-02, PNorm = 472.1612, GNorm = 0.2960, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.082086
Epoch 3144
Loss = 3.0489e-02, PNorm = 472.2353, GNorm = 0.0899, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.068559
Epoch 3145
Loss = 1.0980e-02, PNorm = 472.3172, GNorm = 3.0044, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.137196
Epoch 3146
Loss = 5.8752e-03, PNorm = 472.3885, GNorm = 0.0022, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.101709
Epoch 3147
Loss = 2.7456e-03, PNorm = 472.4383, GNorm = 0.1808, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.089036
Epoch 3148
Loss = 7.1708e-03, PNorm = 472.5249, GNorm = 0.2502, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.085519
Epoch 3149
Loss = 4.8509e-03, PNorm = 472.6032, GNorm = 0.3587, lr_0 = 9.9808e-04
Loss = 2.5553e-02, PNorm = 472.6747, GNorm = 0.8250, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.091870
Epoch 3150
Loss = 1.0813e-02, PNorm = 472.7470, GNorm = 0.1857, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.108360
Epoch 3151
Loss = 3.7159e-02, PNorm = 472.7991, GNorm = 0.1410, lr_0 = 9.9808e-04
Validation binary_cross_entropy = 0.106207
Epoch 3152
Loss = 2.1703e-02, PNorm = 472.8725, GNorm = 4.5128, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.114528
Epoch 3153
Loss = 3.2458e-02, PNorm = 472.9488, GNorm = 2.0678, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.102120
Epoch 3154
Loss = 8.6535e-03, PNorm = 472.9980, GNorm = 0.2435, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.101376
Epoch 3155
Loss = 2.8650e-02, PNorm = 473.0759, GNorm = 1.3539, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.133120
Epoch 3156
Loss = 5.7610e-02, PNorm = 473.1437, GNorm = 1.9384, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.093222
Epoch 3157
Loss = 6.2358e-02, PNorm = 473.2031, GNorm = 3.7796, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.092499
Epoch 3158
Loss = 9.1302e-03, PNorm = 473.2826, GNorm = 0.0718, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.076973
Epoch 3159
Loss = 1.0298e-02, PNorm = 473.3573, GNorm = 0.2480, lr_0 = 9.9807e-04
Loss = 6.0556e-02, PNorm = 473.4341, GNorm = 0.6529, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.084999
Epoch 3160
Loss = 2.0405e-02, PNorm = 473.5087, GNorm = 0.2717, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.080583
Epoch 3161
Loss = 1.6704e-02, PNorm = 473.5693, GNorm = 0.2613, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.082671
Epoch 3162
Loss = 8.3692e-03, PNorm = 473.6271, GNorm = 0.0258, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.093988
Epoch 3163
Loss = 3.5793e-03, PNorm = 473.6861, GNorm = 0.6734, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.102918
Epoch 3164
Loss = 6.6683e-03, PNorm = 473.7335, GNorm = 0.4933, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.124390
Epoch 3165
Loss = 2.9671e-02, PNorm = 473.7675, GNorm = 1.7553, lr_0 = 9.9807e-04
Validation binary_cross_entropy = 0.108087
Epoch 3166
Loss = 1.1823e-02, PNorm = 473.7941, GNorm = 0.0143, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.085450
Epoch 3167
Loss = 1.4578e-02, PNorm = 473.8376, GNorm = 1.4862, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.088635
Epoch 3168
Loss = 5.7938e-02, PNorm = 473.8851, GNorm = 1.3914, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.094922
Epoch 3169
Loss = 3.2442e-02, PNorm = 473.9432, GNorm = 1.9591, lr_0 = 9.9806e-04
Loss = 3.9230e-02, PNorm = 473.9983, GNorm = 1.2847, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.090842
Epoch 3170
Loss = 2.1888e-02, PNorm = 474.0446, GNorm = 0.2303, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.082168
Epoch 3171
Loss = 2.5545e-02, PNorm = 474.0861, GNorm = 1.0830, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.098681
Epoch 3172
Loss = 1.5989e-02, PNorm = 474.1354, GNorm = 0.3067, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.106482
Epoch 3173
Loss = 1.2475e-02, PNorm = 474.2011, GNorm = 0.0246, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.148803
Epoch 3174
Loss = 9.0981e-03, PNorm = 474.2548, GNorm = 0.6459, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.134424
Epoch 3175
Loss = 7.0505e-04, PNorm = 474.2815, GNorm = 0.0243, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.109747
Epoch 3176
Loss = 5.9683e-03, PNorm = 474.3290, GNorm = 0.0980, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.127030
Epoch 3177
Loss = 4.9836e-02, PNorm = 474.3853, GNorm = 0.1134, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.116518
Epoch 3178
Loss = 2.8158e-02, PNorm = 474.4192, GNorm = 0.0956, lr_0 = 9.9806e-04
Validation binary_cross_entropy = 0.083003
Epoch 3179
Loss = 1.5437e-02, PNorm = 474.4590, GNorm = 0.5440, lr_0 = 9.9805e-04
Loss = 4.4522e-02, PNorm = 474.5234, GNorm = 0.0578, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.075983
Epoch 3180
Loss = 1.3929e-02, PNorm = 474.5887, GNorm = 0.4883, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.082659
Epoch 3181
Loss = 1.8647e-02, PNorm = 474.6377, GNorm = 2.0736, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.078486
Epoch 3182
Loss = 1.3786e-02, PNorm = 474.6856, GNorm = 0.6984, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.090351
Epoch 3183
Loss = 9.1050e-03, PNorm = 474.7339, GNorm = 1.8354, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.112107
Epoch 3184
Loss = 1.9866e-02, PNorm = 474.7781, GNorm = 4.2999, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.116632
Epoch 3185
Loss = 4.2315e-02, PNorm = 474.8344, GNorm = 0.2220, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.086574
Epoch 3186
Loss = 8.0075e-03, PNorm = 474.9619, GNorm = 0.0436, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.113750
Epoch 3187
Loss = 3.0713e-02, PNorm = 475.0704, GNorm = 1.0096, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.077417
Epoch 3188
Loss = 1.8210e-02, PNorm = 475.1728, GNorm = 0.7647, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.095625
Epoch 3189
Loss = 1.1752e-02, PNorm = 475.2593, GNorm = 0.9075, lr_0 = 9.9805e-04
Loss = 2.6387e-02, PNorm = 475.3399, GNorm = 1.2397, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.107667
Epoch 3190
Loss = 1.3601e-02, PNorm = 475.4001, GNorm = 0.1029, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.098904
Epoch 3191
Loss = 5.9384e-02, PNorm = 475.4662, GNorm = 0.4934, lr_0 = 9.9805e-04
Validation binary_cross_entropy = 0.070285
Epoch 3192
Loss = 5.9348e-02, PNorm = 475.5501, GNorm = 0.5628, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.069893
Epoch 3193
Loss = 2.8485e-02, PNorm = 475.6181, GNorm = 0.9343, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.063607
Epoch 3194
Loss = 9.8127e-03, PNorm = 475.6801, GNorm = 0.3889, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.072235
Epoch 3195
Loss = 3.1022e-02, PNorm = 475.7427, GNorm = 0.1819, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.070379
Epoch 3196
Loss = 4.6499e-02, PNorm = 475.8176, GNorm = 0.6643, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.100485
Epoch 3197
Loss = 3.8104e-03, PNorm = 475.9179, GNorm = 0.2162, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.087668
Epoch 3198
Loss = 3.6775e-02, PNorm = 476.0044, GNorm = 1.7900, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.075116
Epoch 3199
Loss = 3.9972e-03, PNorm = 476.0773, GNorm = 0.1912, lr_0 = 9.9804e-04
Loss = 3.5855e-02, PNorm = 476.1477, GNorm = 0.0961, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.078544
Epoch 3200
Loss = 1.8907e-02, PNorm = 476.2113, GNorm = 3.6414, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.083603
Epoch 3201
Loss = 3.3612e-02, PNorm = 476.2725, GNorm = 0.7250, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.093800
Epoch 3202
Loss = 2.3416e-02, PNorm = 476.3405, GNorm = 0.1485, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.085633
Epoch 3203
Loss = 3.3346e-02, PNorm = 476.4283, GNorm = 0.6517, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.084525
Epoch 3204
Loss = 2.3636e-02, PNorm = 476.5095, GNorm = 0.1352, lr_0 = 9.9804e-04
Validation binary_cross_entropy = 0.081706
Epoch 3205
Loss = 4.0044e-02, PNorm = 476.5767, GNorm = 0.2710, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.076998
Epoch 3206
Loss = 2.5955e-02, PNorm = 476.6476, GNorm = 0.3713, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.069184
Epoch 3207
Loss = 1.1669e-02, PNorm = 476.7167, GNorm = 0.2008, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.081932
Epoch 3208
Loss = 1.0812e-02, PNorm = 476.7865, GNorm = 0.8794, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.105688
Epoch 3209
Loss = 2.3378e-02, PNorm = 476.8446, GNorm = 1.1779, lr_0 = 9.9803e-04
Loss = 1.5454e-02, PNorm = 476.9039, GNorm = 0.5000, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.131605
Epoch 3210
Loss = 2.6495e-02, PNorm = 476.9554, GNorm = 0.1799, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.086541
Epoch 3211
Loss = 6.9427e-02, PNorm = 477.0508, GNorm = 0.1392, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.088254
Epoch 3212
Loss = 3.2662e-02, PNorm = 477.1312, GNorm = 0.9563, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.075489
Epoch 3213
Loss = 3.7112e-02, PNorm = 477.1984, GNorm = 0.0277, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.089971
Epoch 3214
Loss = 2.7401e-02, PNorm = 477.2700, GNorm = 0.0698, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.085571
Epoch 3215
Loss = 1.5253e-02, PNorm = 477.3174, GNorm = 0.0969, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.069169
Epoch 3216
Loss = 4.5965e-02, PNorm = 477.3689, GNorm = 0.9252, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.073323
Epoch 3217
Loss = 2.6847e-02, PNorm = 477.4368, GNorm = 0.4384, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.086816
Epoch 3218
Loss = 6.7887e-03, PNorm = 477.5046, GNorm = 0.2617, lr_0 = 9.9803e-04
Validation binary_cross_entropy = 0.108644
Epoch 3219
Loss = 6.4659e-03, PNorm = 477.5576, GNorm = 0.3305, lr_0 = 9.9802e-04
Loss = 1.7157e-02, PNorm = 477.5860, GNorm = 0.0306, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.105877
Epoch 3220
Loss = 1.4447e-02, PNorm = 477.6299, GNorm = 0.0548, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.097575
Epoch 3221
Loss = 6.8960e-03, PNorm = 477.6838, GNorm = 0.3992, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.114634
Epoch 3222
Loss = 1.5311e-01, PNorm = 477.7671, GNorm = 3.3949, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.061720
Epoch 3223
Loss = 2.5724e-02, PNorm = 477.9105, GNorm = 0.6740, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.082846
Epoch 3224
Loss = 2.7015e-02, PNorm = 478.0101, GNorm = 0.5173, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.067699
Epoch 3225
Loss = 2.8747e-02, PNorm = 478.0764, GNorm = 0.1129, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.067085
Epoch 3226
Loss = 4.4162e-02, PNorm = 478.1502, GNorm = 0.1066, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.087746
Epoch 3227
Loss = 2.1683e-02, PNorm = 478.2160, GNorm = 0.0341, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.075854
Epoch 3228
Loss = 9.4769e-02, PNorm = 478.2657, GNorm = 1.0138, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.065442
Epoch 3229
Loss = 5.6953e-03, PNorm = 478.3112, GNorm = 0.2253, lr_0 = 9.9802e-04
Loss = 3.2336e-02, PNorm = 478.3540, GNorm = 0.2570, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.066433
Epoch 3230
Loss = 1.8813e-02, PNorm = 478.4041, GNorm = 1.4330, lr_0 = 9.9802e-04
Validation binary_cross_entropy = 0.075318
Epoch 3231
Loss = 6.0636e-03, PNorm = 478.4452, GNorm = 0.2795, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.081469
Epoch 3232
Loss = 6.5172e-03, PNorm = 478.4784, GNorm = 1.0442, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.090659
Epoch 3233
Loss = 3.6023e-02, PNorm = 478.5006, GNorm = 2.5553, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.078852
Epoch 3234
Loss = 3.3063e-02, PNorm = 478.5332, GNorm = 2.0220, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.091673
Epoch 3235
Loss = 1.6138e-02, PNorm = 478.5909, GNorm = 1.7436, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.115180
Epoch 3236
Loss = 2.3813e-03, PNorm = 478.6402, GNorm = 0.0237, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.085350
Epoch 3237
Loss = 8.6187e-03, PNorm = 478.6744, GNorm = 0.3312, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.078016
Epoch 3238
Loss = 1.4939e-02, PNorm = 478.7279, GNorm = 0.6130, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.097589
Epoch 3239
Loss = 3.6364e-03, PNorm = 478.7867, GNorm = 0.2311, lr_0 = 9.9801e-04
Loss = 3.9533e-02, PNorm = 478.8373, GNorm = 0.6351, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.081663
Epoch 3240
Loss = 9.5461e-03, PNorm = 478.8996, GNorm = 0.0377, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.074115
Epoch 3241
Loss = 4.5491e-02, PNorm = 478.9563, GNorm = 0.4675, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.067590
Epoch 3242
Loss = 1.5989e-02, PNorm = 479.0203, GNorm = 0.2101, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.068901
Epoch 3243
Loss = 1.9476e-02, PNorm = 479.0843, GNorm = 0.1836, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.071838
Epoch 3244
Loss = 1.9228e-02, PNorm = 479.1479, GNorm = 0.4569, lr_0 = 9.9801e-04
Validation binary_cross_entropy = 0.078011
Epoch 3245
Loss = 2.0216e-02, PNorm = 479.2082, GNorm = 0.3160, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.086432
Epoch 3246
Loss = 6.7193e-03, PNorm = 479.2734, GNorm = 0.5368, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.102001
Epoch 3247
Loss = 2.5827e-02, PNorm = 479.3231, GNorm = 0.0151, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.097819
Epoch 3248
Loss = 1.1196e-03, PNorm = 479.3656, GNorm = 0.0549, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.089977
Epoch 3249
Loss = 8.8419e-04, PNorm = 479.4091, GNorm = 0.0217, lr_0 = 9.9800e-04
Loss = 1.8628e-02, PNorm = 479.4565, GNorm = 1.6140, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.105359
Epoch 3250
Loss = 6.3782e-02, PNorm = 479.5029, GNorm = 0.1734, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.082130
Epoch 3251
Loss = 7.2867e-02, PNorm = 479.6142, GNorm = 0.2996, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.078230
Epoch 3252
Loss = 1.2027e-02, PNorm = 479.7219, GNorm = 0.0739, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.080785
Epoch 3253
Loss = 2.1845e-02, PNorm = 479.7975, GNorm = 1.9150, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.086487
Epoch 3254
Loss = 2.4370e-02, PNorm = 479.8783, GNorm = 0.1821, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.080468
Epoch 3255
Loss = 3.2537e-02, PNorm = 479.9752, GNorm = 0.5652, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.082103
Epoch 3256
Loss = 7.2187e-02, PNorm = 480.0478, GNorm = 1.1031, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.084256
Epoch 3257
Loss = 6.8380e-02, PNorm = 480.1243, GNorm = 1.4829, lr_0 = 9.9800e-04
Validation binary_cross_entropy = 0.081459
Epoch 3258
Loss = 5.7768e-02, PNorm = 480.1896, GNorm = 2.0289, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.079945
Epoch 3259
Loss = 8.3315e-03, PNorm = 480.2622, GNorm = 0.4774, lr_0 = 9.9799e-04
Loss = 4.5397e-02, PNorm = 480.3283, GNorm = 1.6779, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.069298
Epoch 3260
Loss = 3.4858e-02, PNorm = 480.3965, GNorm = 0.4287, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.061432
Epoch 3261
Loss = 2.9626e-02, PNorm = 480.4591, GNorm = 0.8721, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.064715
Epoch 3262
Loss = 1.7821e-02, PNorm = 480.5208, GNorm = 0.2964, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.068997
Epoch 3263
Loss = 1.1972e-02, PNorm = 480.5633, GNorm = 0.1276, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.077060
Epoch 3264
Loss = 3.5132e-02, PNorm = 480.5887, GNorm = 0.9212, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.077400
Epoch 3265
Loss = 4.8505e-03, PNorm = 480.6110, GNorm = 0.0632, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.076655
Epoch 3266
Loss = 1.6977e-03, PNorm = 480.6532, GNorm = 0.1819, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.078837
Epoch 3267
Loss = 1.1816e-01, PNorm = 480.6884, GNorm = 3.8349, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.056856
Epoch 3268
Loss = 6.0749e-02, PNorm = 480.7399, GNorm = 0.6481, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.065797
Epoch 3269
Loss = 3.2037e-01, PNorm = 480.8327, GNorm = 4.5766, lr_0 = 9.9799e-04
Loss = 3.6139e-02, PNorm = 480.9147, GNorm = 0.1866, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.047960
Epoch 3270
Loss = 2.4372e-02, PNorm = 480.9860, GNorm = 0.0636, lr_0 = 9.9799e-04
Validation binary_cross_entropy = 0.057850
Epoch 3271
Loss = 1.7741e-02, PNorm = 481.0506, GNorm = 0.0698, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.064745
Epoch 3272
Loss = 1.3215e-02, PNorm = 481.0975, GNorm = 0.5976, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.066704
Epoch 3273
Loss = 5.6954e-03, PNorm = 481.1372, GNorm = 0.1719, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.074049
Epoch 3274
Loss = 2.9257e-02, PNorm = 481.1677, GNorm = 1.4662, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.065471
Epoch 3275
Loss = 1.1594e-02, PNorm = 481.1950, GNorm = 1.8496, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.064372
Epoch 3276
Loss = 2.6752e-02, PNorm = 481.2347, GNorm = 0.0517, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.068338
Epoch 3277
Loss = 2.5850e-03, PNorm = 481.2742, GNorm = 0.0215, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.071935
Epoch 3278
Loss = 2.2900e-03, PNorm = 481.3069, GNorm = 0.1519, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.064300
Epoch 3279
Loss = 2.8586e-03, PNorm = 481.3337, GNorm = 0.0840, lr_0 = 9.9798e-04
Loss = 2.6303e-02, PNorm = 481.3582, GNorm = 0.6199, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.054835
Epoch 3280
Loss = 1.5892e-02, PNorm = 481.4060, GNorm = 0.1143, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.065012
Epoch 3281
Loss = 2.2698e-02, PNorm = 481.4387, GNorm = 2.3045, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.061058
Epoch 3282
Loss = 5.7265e-03, PNorm = 481.4824, GNorm = 0.3012, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.071700
Epoch 3283
Loss = 4.4676e-02, PNorm = 481.5350, GNorm = 2.6422, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.062446
Epoch 3284
Loss = 1.2954e-02, PNorm = 481.5893, GNorm = 0.3387, lr_0 = 9.9798e-04
Validation binary_cross_entropy = 0.067922
Epoch 3285
Loss = 6.6194e-02, PNorm = 481.6612, GNorm = 0.8040, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.066323
Epoch 3286
Loss = 1.5855e-02, PNorm = 481.7375, GNorm = 0.0534, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.072329
Epoch 3287
Loss = 8.2010e-03, PNorm = 481.7948, GNorm = 0.0596, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.069128
Epoch 3288
Loss = 3.8840e-02, PNorm = 481.8402, GNorm = 2.1375, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.085806
Epoch 3289
Loss = 1.4034e-01, PNorm = 481.8986, GNorm = 2.8899, lr_0 = 9.9797e-04
Loss = 1.1722e-02, PNorm = 481.9428, GNorm = 0.4734, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.071301
Epoch 3290
Loss = 2.5215e-02, PNorm = 481.9766, GNorm = 0.5154, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.066915
Epoch 3291
Loss = 1.8055e-02, PNorm = 482.0257, GNorm = 0.5124, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.068794
Epoch 3292
Loss = 1.6228e-02, PNorm = 482.0958, GNorm = 0.0828, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.081176
Epoch 3293
Loss = 1.6165e-02, PNorm = 482.1434, GNorm = 0.8205, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.068614
Epoch 3294
Loss = 3.9745e-02, PNorm = 482.1835, GNorm = 0.8276, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.064666
Epoch 3295
Loss = 8.6831e-03, PNorm = 482.2430, GNorm = 0.6616, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.065739
Epoch 3296
Loss = 1.1930e-02, PNorm = 482.2939, GNorm = 1.0940, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.066393
Epoch 3297
Loss = 1.3660e-02, PNorm = 482.3584, GNorm = 0.8271, lr_0 = 9.9797e-04
Validation binary_cross_entropy = 0.074741
Epoch 3298
Loss = 1.4150e-02, PNorm = 482.4085, GNorm = 0.0570, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.068186
Epoch 3299
Loss = 2.5106e-02, PNorm = 482.4501, GNorm = 0.5646, lr_0 = 9.9796e-04
Loss = 1.9572e-02, PNorm = 482.5031, GNorm = 0.3711, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.071126
Epoch 3300
Loss = 1.9459e-02, PNorm = 482.5476, GNorm = 1.9000, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.077560
Epoch 3301
Loss = 4.3782e-01, PNorm = 482.6004, GNorm = 0.7392, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.058362
Epoch 3302
Loss = 1.1611e-01, PNorm = 482.8315, GNorm = 1.7976, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.099405
Epoch 3303
Loss = 7.7981e-02, PNorm = 483.0198, GNorm = 1.2930, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.069080
Epoch 3304
Loss = 5.1446e-02, PNorm = 483.1687, GNorm = 0.6499, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.097407
Epoch 3305
Loss = 5.1730e-02, PNorm = 483.2633, GNorm = 0.2322, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.077773
Epoch 3306
Loss = 5.3135e-02, PNorm = 483.3485, GNorm = 0.0442, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.092692
Epoch 3307
Loss = 2.0495e-02, PNorm = 483.4423, GNorm = 0.9755, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.088316
Epoch 3308
Loss = 3.3353e-02, PNorm = 483.5243, GNorm = 1.9006, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.077305
Epoch 3309
Loss = 4.7720e-03, PNorm = 483.6046, GNorm = 0.4114, lr_0 = 9.9796e-04
Loss = 1.7368e-02, PNorm = 483.6831, GNorm = 0.1206, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.078022
Epoch 3310
Loss = 2.0817e-02, PNorm = 483.7469, GNorm = 0.0434, lr_0 = 9.9796e-04
Validation binary_cross_entropy = 0.086446
Epoch 3311
Loss = 4.3145e-02, PNorm = 483.8047, GNorm = 3.4977, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.066996
Epoch 3312
Loss = 3.8869e-02, PNorm = 483.8969, GNorm = 0.1458, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.089426
Epoch 3313
Loss = 3.1662e-02, PNorm = 483.9738, GNorm = 0.8538, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.070624
Epoch 3314
Loss = 5.5145e-02, PNorm = 484.0532, GNorm = 1.6547, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.070553
Epoch 3315
Loss = 1.1129e-02, PNorm = 484.1524, GNorm = 0.5200, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.075060
Epoch 3316
Loss = 6.1763e-02, PNorm = 484.2099, GNorm = 2.2421, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.063897
Epoch 3317
Loss = 2.1792e-02, PNorm = 484.2592, GNorm = 1.2774, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.082039
Epoch 3318
Loss = 2.2144e-03, PNorm = 484.3165, GNorm = 0.1122, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.087816
Epoch 3319
Loss = 2.4207e-02, PNorm = 484.3534, GNorm = 1.4175, lr_0 = 9.9795e-04
Loss = 2.5696e-02, PNorm = 484.3961, GNorm = 0.4955, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.081968
Epoch 3320
Loss = 2.3984e-02, PNorm = 484.4565, GNorm = 1.2602, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.080734
Epoch 3321
Loss = 2.3519e-02, PNorm = 484.5027, GNorm = 0.2014, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.076863
Epoch 3322
Loss = 1.5363e-02, PNorm = 484.5520, GNorm = 0.7553, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.086976
Epoch 3323
Loss = 5.1539e-02, PNorm = 484.6108, GNorm = 0.5040, lr_0 = 9.9795e-04
Validation binary_cross_entropy = 0.088465
Epoch 3324
Loss = 2.4703e-02, PNorm = 484.6786, GNorm = 1.3314, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.083385
Epoch 3325
Loss = 1.4773e-02, PNorm = 484.7504, GNorm = 0.8289, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.079133
Epoch 3326
Loss = 5.5519e-03, PNorm = 484.8299, GNorm = 0.2750, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.139186
Epoch 3327
Loss = 9.8330e-03, PNorm = 484.9208, GNorm = 0.2134, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.137032
Epoch 3328
Loss = 1.8728e-02, PNorm = 484.9826, GNorm = 0.8667, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.131913
Epoch 3329
Loss = 1.5262e-02, PNorm = 485.0611, GNorm = 0.5406, lr_0 = 9.9794e-04
Loss = 5.5466e-03, PNorm = 485.1350, GNorm = 0.0259, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.168228
Epoch 3330
Loss = 5.4433e-02, PNorm = 485.1722, GNorm = 0.1104, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.138455
Epoch 3331
Loss = 3.7333e-02, PNorm = 485.2369, GNorm = 1.5126, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.133024
Epoch 3332
Loss = 1.4273e-02, PNorm = 485.3149, GNorm = 0.8327, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.143792
Epoch 3333
Loss = 4.3402e-02, PNorm = 485.3755, GNorm = 0.1673, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.114936
Epoch 3334
Loss = 6.0883e-02, PNorm = 485.4721, GNorm = 3.9519, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.146154
Epoch 3335
Loss = 1.0658e-01, PNorm = 485.6091, GNorm = 0.3412, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.075244
Epoch 3336
Loss = 3.6704e-02, PNorm = 485.7215, GNorm = 1.3493, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.063749
Epoch 3337
Loss = 3.3229e-02, PNorm = 485.8272, GNorm = 3.3612, lr_0 = 9.9794e-04
Validation binary_cross_entropy = 0.086096
Epoch 3338
Loss = 5.6643e-03, PNorm = 485.9321, GNorm = 0.3535, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.088230
Epoch 3339
Loss = 3.2212e-03, PNorm = 485.9987, GNorm = 0.1711, lr_0 = 9.9793e-04
Loss = 4.5287e-02, PNorm = 486.0507, GNorm = 3.9612, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.082641
Epoch 3340
Loss = 2.2459e-02, PNorm = 486.1296, GNorm = 0.2857, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.088349
Epoch 3341
Loss = 3.2195e-02, PNorm = 486.2161, GNorm = 1.0625, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.099617
Epoch 3342
Loss = 2.0811e-02, PNorm = 486.2925, GNorm = 1.7040, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.111423
Epoch 3343
Loss = 8.0301e-03, PNorm = 486.3572, GNorm = 0.0576, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.116590
Epoch 3344
Loss = 1.5341e-02, PNorm = 486.4051, GNorm = 0.7579, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.117534
Epoch 3345
Loss = 1.3742e-02, PNorm = 486.4547, GNorm = 1.0716, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.124369
Epoch 3346
Loss = 1.2382e-02, PNorm = 486.4910, GNorm = 0.0936, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.121485
Epoch 3347
Loss = 1.8182e-03, PNorm = 486.5395, GNorm = 0.0338, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.111787
Epoch 3348
Loss = 6.3657e-03, PNorm = 486.5895, GNorm = 0.1903, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.102500
Epoch 3349
Loss = 4.0885e-03, PNorm = 486.6430, GNorm = 0.2349, lr_0 = 9.9793e-04
Loss = 3.2467e-02, PNorm = 486.6991, GNorm = 2.0501, lr_0 = 9.9793e-04
Validation binary_cross_entropy = 0.100633
Epoch 3350
Loss = 2.5964e-02, PNorm = 486.7624, GNorm = 0.7829, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.083840
Epoch 3351
Loss = 1.6117e-02, PNorm = 486.8329, GNorm = 0.5554, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.098787
Epoch 3352
Loss = 2.1099e-02, PNorm = 486.8828, GNorm = 0.7112, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.083597
Epoch 3353
Loss = 1.6987e-02, PNorm = 486.9243, GNorm = 0.1242, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.085126
Epoch 3354
Loss = 3.1123e-02, PNorm = 486.9773, GNorm = 3.8159, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.089702
Epoch 3355
Loss = 1.1042e-02, PNorm = 487.0292, GNorm = 0.1723, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.092272
Epoch 3356
Loss = 4.2157e-03, PNorm = 487.0901, GNorm = 0.1867, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.096555
Epoch 3357
Loss = 1.8642e-02, PNorm = 487.1559, GNorm = 1.1184, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.100012
Epoch 3358
Loss = 2.7824e-02, PNorm = 487.2176, GNorm = 1.2719, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.087346
Epoch 3359
Loss = 2.0914e-02, PNorm = 487.2674, GNorm = 0.8746, lr_0 = 9.9792e-04
Loss = 4.1984e-02, PNorm = 487.3353, GNorm = 1.5388, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.083153
Epoch 3360
Loss = 2.6402e-02, PNorm = 487.4098, GNorm = 0.0413, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.082040
Epoch 3361
Loss = 1.2924e-02, PNorm = 487.4776, GNorm = 0.1105, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.082450
Epoch 3362
Loss = 1.8575e-02, PNorm = 487.5401, GNorm = 0.9608, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.092499
Epoch 3363
Loss = 1.0698e-01, PNorm = 487.6235, GNorm = 0.1477, lr_0 = 9.9792e-04
Validation binary_cross_entropy = 0.073849
Epoch 3364
Loss = 2.7263e-02, PNorm = 487.7335, GNorm = 3.1252, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.074788
Epoch 3365
Loss = 3.7890e-02, PNorm = 487.8196, GNorm = 0.1120, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.065529
Epoch 3366
Loss = 7.0979e-03, PNorm = 487.8915, GNorm = 0.0970, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.063424
Epoch 3367
Loss = 3.1527e-03, PNorm = 487.9572, GNorm = 0.0573, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.066151
Epoch 3368
Loss = 3.8265e-02, PNorm = 488.0325, GNorm = 0.1835, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.074604
Epoch 3369
Loss = 3.1119e-03, PNorm = 488.0942, GNorm = 0.1921, lr_0 = 9.9791e-04
Loss = 2.3651e-02, PNorm = 488.1407, GNorm = 0.0862, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.084559
Epoch 3370
Loss = 8.8389e-03, PNorm = 488.1987, GNorm = 0.4034, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.115892
Epoch 3371
Loss = 1.4427e-02, PNorm = 488.2569, GNorm = 0.1362, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.202770
Epoch 3372
Loss = 4.6685e-02, PNorm = 488.2857, GNorm = 0.1516, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.078841
Epoch 3373
Loss = 1.8827e-02, PNorm = 488.3330, GNorm = 1.6028, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.080173
Epoch 3374
Loss = 3.9877e-02, PNorm = 488.3969, GNorm = 1.1425, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.081847
Epoch 3375
Loss = 3.0032e-02, PNorm = 488.4615, GNorm = 1.1473, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.081925
Epoch 3376
Loss = 1.5290e-02, PNorm = 488.5066, GNorm = 0.0652, lr_0 = 9.9791e-04
Validation binary_cross_entropy = 0.086593
Epoch 3377
Loss = 5.9854e-03, PNorm = 488.5520, GNorm = 0.0285, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.073952
Epoch 3378
Loss = 2.4538e-03, PNorm = 488.5842, GNorm = 0.0873, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.067707
Epoch 3379
Loss = 2.1901e-02, PNorm = 488.6478, GNorm = 1.1635, lr_0 = 9.9790e-04
Loss = 3.3525e-02, PNorm = 488.7135, GNorm = 0.4748, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.063661
Epoch 3380
Loss = 3.0949e-02, PNorm = 488.7852, GNorm = 1.3214, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.063053
Epoch 3381
Loss = 2.7941e-02, PNorm = 488.8701, GNorm = 0.6830, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.124921
Epoch 3382
Loss = 5.0343e-02, PNorm = 488.9573, GNorm = 1.2454, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.061723
Epoch 3383
Loss = 1.7722e-02, PNorm = 489.0725, GNorm = 0.8863, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.070921
Epoch 3384
Loss = 9.9167e-03, PNorm = 489.1655, GNorm = 0.6599, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.076340
Epoch 3385
Loss = 3.4405e-02, PNorm = 489.2352, GNorm = 2.1774, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.077486
Epoch 3386
Loss = 4.6020e-03, PNorm = 489.3001, GNorm = 0.0869, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.078838
Epoch 3387
Loss = 2.1052e-02, PNorm = 489.3520, GNorm = 1.0304, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.073293
Epoch 3388
Loss = 6.8288e-03, PNorm = 489.3970, GNorm = 0.2142, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.076888
Epoch 3389
Loss = 3.3862e-02, PNorm = 489.4377, GNorm = 1.7138, lr_0 = 9.9790e-04
Loss = 2.1453e-02, PNorm = 489.4762, GNorm = 0.4368, lr_0 = 9.9790e-04
Validation binary_cross_entropy = 0.079560
Epoch 3390
Loss = 1.7757e-02, PNorm = 489.5342, GNorm = 0.0821, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.091975
Epoch 3391
Loss = 1.5267e-02, PNorm = 489.5806, GNorm = 3.9758, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.100903
Epoch 3392
Loss = 5.6276e-02, PNorm = 489.6355, GNorm = 0.0523, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.098225
Epoch 3393
Loss = 3.9876e-02, PNorm = 489.6845, GNorm = 0.5091, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.092068
Epoch 3394
Loss = 6.8014e-02, PNorm = 489.7328, GNorm = 0.2175, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.097370
Epoch 3395
Loss = 7.2544e-03, PNorm = 489.7987, GNorm = 0.1300, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.107088
Epoch 3396
Loss = 1.0241e-01, PNorm = 489.8520, GNorm = 2.6022, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.065479
Epoch 3397
Loss = 5.0465e-02, PNorm = 489.9269, GNorm = 0.3016, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.082183
Epoch 3398
Loss = 1.0234e-02, PNorm = 490.0490, GNorm = 0.1554, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.081354
Epoch 3399
Loss = 2.4032e-02, PNorm = 490.1297, GNorm = 0.7783, lr_0 = 9.9789e-04
Loss = 1.6052e-02, PNorm = 490.1907, GNorm = 0.4086, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.083340
Epoch 3400
Loss = 1.4582e-02, PNorm = 490.2499, GNorm = 0.9928, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.092767
Epoch 3401
Loss = 8.7705e-03, PNorm = 490.3110, GNorm = 0.2273, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.100223
Epoch 3402
Loss = 2.6986e-03, PNorm = 490.3568, GNorm = 0.1347, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.107462
Epoch 3403
Loss = 9.8482e-02, PNorm = 490.3921, GNorm = 0.0316, lr_0 = 9.9789e-04
Validation binary_cross_entropy = 0.100910
Epoch 3404
Loss = 4.4715e-02, PNorm = 490.4595, GNorm = 1.0999, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.071435
Epoch 3405
Loss = 1.9959e-02, PNorm = 490.5347, GNorm = 0.4710, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.083109
Epoch 3406
Loss = 1.5342e-02, PNorm = 490.6032, GNorm = 0.4945, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.085086
Epoch 3407
Loss = 1.1464e-02, PNorm = 490.6529, GNorm = 0.5896, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.093911
Epoch 3408
Loss = 1.3388e-01, PNorm = 490.6964, GNorm = 0.1539, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.087223
Epoch 3409
Loss = 1.7967e-03, PNorm = 490.7389, GNorm = 0.0421, lr_0 = 9.9788e-04
Loss = 8.9642e-03, PNorm = 490.7857, GNorm = 0.6684, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.085365
Epoch 3410
Loss = 3.5855e-02, PNorm = 490.8424, GNorm = 2.9024, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.117940
Epoch 3411
Loss = 2.7751e-02, PNorm = 490.9000, GNorm = 2.6157, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.096019
Epoch 3412
Loss = 1.0448e-01, PNorm = 490.9560, GNorm = 0.1390, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.049274
Epoch 3413
Loss = 4.4211e-02, PNorm = 491.0854, GNorm = 0.6099, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.056031
Epoch 3414
Loss = 4.8838e-02, PNorm = 491.2160, GNorm = 0.1315, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.058850
Epoch 3415
Loss = 9.4566e-03, PNorm = 491.3064, GNorm = 0.8040, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.068753
Epoch 3416
Loss = 2.5427e-02, PNorm = 491.3716, GNorm = 0.9150, lr_0 = 9.9788e-04
Validation binary_cross_entropy = 0.086736
Epoch 3417
Loss = 3.4234e-02, PNorm = 491.4393, GNorm = 0.0605, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.070932
Epoch 3418
Loss = 1.1368e-02, PNorm = 491.4976, GNorm = 1.4248, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.078929
Epoch 3419
Loss = 8.1421e-04, PNorm = 491.6134, GNorm = 0.0603, lr_0 = 9.9787e-04
Loss = 1.2208e-01, PNorm = 491.7144, GNorm = 0.8497, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.052433
Epoch 3420
Loss = 3.4863e-02, PNorm = 491.8498, GNorm = 0.4990, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.056593
Epoch 3421
Loss = 1.8772e-02, PNorm = 491.9704, GNorm = 0.3487, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.073890
Epoch 3422
Loss = 1.8256e-02, PNorm = 492.0422, GNorm = 0.0425, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.074837
Epoch 3423
Loss = 7.0156e-03, PNorm = 492.0989, GNorm = 0.0998, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.077868
Epoch 3424
Loss = 1.2614e-02, PNorm = 492.1606, GNorm = 0.0476, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.118194
Epoch 3425
Loss = 5.3472e-02, PNorm = 492.2540, GNorm = 1.6010, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.068543
Epoch 3426
Loss = 3.4386e-02, PNorm = 492.3391, GNorm = 1.1034, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.074030
Epoch 3427
Loss = 2.6698e-02, PNorm = 492.4199, GNorm = 0.4650, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.075462
Epoch 3428
Loss = 6.2564e-02, PNorm = 492.4922, GNorm = 1.2216, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.062509
Epoch 3429
Loss = 2.2679e-02, PNorm = 492.5593, GNorm = 0.9662, lr_0 = 9.9787e-04
Loss = 2.6091e-02, PNorm = 492.6330, GNorm = 1.3491, lr_0 = 9.9787e-04
Validation binary_cross_entropy = 0.064097
Epoch 3430
Loss = 6.1344e-02, PNorm = 492.7181, GNorm = 0.2957, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.060429
Epoch 3431
Loss = 3.1515e-02, PNorm = 492.8031, GNorm = 1.4860, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.054747
Epoch 3432
Loss = 4.2215e-02, PNorm = 492.8957, GNorm = 0.7592, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.070209
Epoch 3433
Loss = 6.3834e-02, PNorm = 492.9637, GNorm = 2.7393, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.050064
Epoch 3434
Loss = 2.3252e-02, PNorm = 493.0420, GNorm = 0.6138, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.056662
Epoch 3435
Loss = 4.1730e-02, PNorm = 493.1325, GNorm = 0.0545, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.060836
Epoch 3436
Loss = 7.0619e-03, PNorm = 493.2184, GNorm = 0.2230, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.088285
Epoch 3437
Loss = 5.4718e-03, PNorm = 493.2938, GNorm = 0.1939, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.082110
Epoch 3438
Loss = 1.3493e-03, PNorm = 493.3509, GNorm = 0.4696, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.088700
Epoch 3439
Loss = 1.6390e-02, PNorm = 493.4079, GNorm = 1.0348, lr_0 = 9.9786e-04
Loss = 4.1040e-02, PNorm = 493.4602, GNorm = 4.1649, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.078585
Epoch 3440
Loss = 1.3902e-02, PNorm = 493.5288, GNorm = 0.3654, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.106174
Epoch 3441
Loss = 1.3337e-02, PNorm = 493.5998, GNorm = 0.1460, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.108254
Epoch 3442
Loss = 1.8994e-02, PNorm = 493.6678, GNorm = 2.3244, lr_0 = 9.9786e-04
Validation binary_cross_entropy = 0.153516
Epoch 3443
Loss = 1.1150e-01, PNorm = 493.7315, GNorm = 1.5437, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.069812
Epoch 3444
Loss = 2.3881e-02, PNorm = 493.8142, GNorm = 0.1598, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.094670
Epoch 3445
Loss = 3.3347e-02, PNorm = 493.8989, GNorm = 0.9502, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.092246
Epoch 3446
Loss = 2.0895e-02, PNorm = 493.9569, GNorm = 0.5743, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.084451
Epoch 3447
Loss = 3.7631e-02, PNorm = 494.0178, GNorm = 1.9314, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.072556
Epoch 3448
Loss = 4.5196e-02, PNorm = 494.1173, GNorm = 1.0932, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.059361
Epoch 3449
Loss = 2.7256e-02, PNorm = 494.1991, GNorm = 0.5495, lr_0 = 9.9785e-04
Loss = 2.6241e-02, PNorm = 494.2809, GNorm = 2.7043, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.066182
Epoch 3450
Loss = 2.5169e-02, PNorm = 494.3703, GNorm = 0.0204, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.091256
Epoch 3451
Loss = 6.9002e-02, PNorm = 494.4329, GNorm = 0.6898, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.065017
Epoch 3452
Loss = 1.1006e-02, PNorm = 494.4938, GNorm = 0.1419, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.071821
Epoch 3453
Loss = 6.7504e-03, PNorm = 494.5613, GNorm = 0.0703, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.089160
Epoch 3454
Loss = 2.4528e-02, PNorm = 494.6025, GNorm = 0.0449, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.073900
Epoch 3455
Loss = 1.0816e-02, PNorm = 494.6503, GNorm = 0.1345, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.068510
Epoch 3456
Loss = 9.2746e-02, PNorm = 494.7132, GNorm = 1.1818, lr_0 = 9.9785e-04
Validation binary_cross_entropy = 0.082736
Epoch 3457
Loss = 5.4756e-02, PNorm = 494.7805, GNorm = 1.8906, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.061224
Epoch 3458
Loss = 2.2024e-02, PNorm = 494.8358, GNorm = 1.5031, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.067319
Epoch 3459
Loss = 6.3212e-03, PNorm = 494.9091, GNorm = 0.1437, lr_0 = 9.9784e-04
Loss = 2.1724e-02, PNorm = 494.9619, GNorm = 0.9771, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.075105
Epoch 3460
Loss = 1.1574e-02, PNorm = 495.0069, GNorm = 0.7246, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.069447
Epoch 3461
Loss = 3.6200e-02, PNorm = 495.0587, GNorm = 2.6592, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.077948
Epoch 3462
Loss = 2.6307e-02, PNorm = 495.1327, GNorm = 0.2679, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.088379
Epoch 3463
Loss = 1.4328e-02, PNorm = 495.1850, GNorm = 1.6867, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.062317
Epoch 3464
Loss = 5.6717e-02, PNorm = 495.2476, GNorm = 0.1313, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.059672
Epoch 3465
Loss = 2.3617e-02, PNorm = 495.3220, GNorm = 0.0737, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.081717
Epoch 3466
Loss = 1.2993e-02, PNorm = 495.3920, GNorm = 0.3065, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.090747
Epoch 3467
Loss = 1.0649e-02, PNorm = 495.4417, GNorm = 0.0285, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.071639
Epoch 3468
Loss = 4.1152e-03, PNorm = 495.4754, GNorm = 0.2856, lr_0 = 9.9784e-04
Validation binary_cross_entropy = 0.075618
Epoch 3469
Loss = 1.0429e-03, PNorm = 495.5264, GNorm = 0.0324, lr_0 = 9.9784e-04
Loss = 3.1229e-03, PNorm = 495.5692, GNorm = 0.0108, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.090149
Epoch 3470
Loss = 1.7830e-02, PNorm = 495.6029, GNorm = 0.8310, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.085921
Epoch 3471
Loss = 6.0032e-02, PNorm = 495.6339, GNorm = 0.0237, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.076562
Epoch 3472
Loss = 2.4953e-02, PNorm = 495.6787, GNorm = 0.9860, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.073279
Epoch 3473
Loss = 1.0609e-01, PNorm = 495.7400, GNorm = 4.2649, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.067780
Epoch 3474
Loss = 2.2204e-02, PNorm = 495.7853, GNorm = 0.1908, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.063341
Epoch 3475
Loss = 2.5072e-02, PNorm = 495.8487, GNorm = 0.2034, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.066971
Epoch 3476
Loss = 2.5458e-03, PNorm = 495.9134, GNorm = 0.0296, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.078563
Epoch 3477
Loss = 1.6160e-02, PNorm = 495.9545, GNorm = 0.8398, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.079379
Epoch 3478
Loss = 1.3617e-03, PNorm = 495.9858, GNorm = 0.1454, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.124628
Epoch 3479
Loss = 1.0624e-03, PNorm = 496.0227, GNorm = 0.1217, lr_0 = 9.9783e-04
Loss = 6.6402e-02, PNorm = 496.0491, GNorm = 0.1200, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.084425
Epoch 3480
Loss = 3.9869e-02, PNorm = 496.1059, GNorm = 0.4194, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.066653
Epoch 3481
Loss = 2.9116e-02, PNorm = 496.1758, GNorm = 0.0659, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.066189
Epoch 3482
Loss = 7.8428e-03, PNorm = 496.2276, GNorm = 0.3014, lr_0 = 9.9783e-04
Validation binary_cross_entropy = 0.073703
Epoch 3483
Loss = 7.8103e-03, PNorm = 496.2742, GNorm = 0.0381, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.078267
Epoch 3484
Loss = 3.3750e-02, PNorm = 496.3146, GNorm = 0.7216, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.080495
Epoch 3485
Loss = 4.2273e-03, PNorm = 496.3693, GNorm = 0.0651, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.083335
Epoch 3486
Loss = 1.0713e-02, PNorm = 496.4388, GNorm = 0.0875, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.159418
Epoch 3487
Loss = 3.8565e-02, PNorm = 496.5039, GNorm = 0.3913, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.067794
Epoch 3488
Loss = 1.0523e-01, PNorm = 496.5704, GNorm = 1.3606, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.143940
Epoch 3489
Loss = 2.8958e-02, PNorm = 496.6651, GNorm = 0.9436, lr_0 = 9.9782e-04
Loss = 3.3310e-02, PNorm = 496.7252, GNorm = 1.0893, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.086945
Epoch 3490
Loss = 1.5083e-02, PNorm = 496.7928, GNorm = 1.3600, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.084986
Epoch 3491
Loss = 8.5888e-03, PNorm = 496.8615, GNorm = 0.0134, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.105998
Epoch 3492
Loss = 1.1651e-02, PNorm = 496.9120, GNorm = 0.0642, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.112389
Epoch 3493
Loss = 7.2342e-02, PNorm = 496.9587, GNorm = 0.7007, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.085207
Epoch 3494
Loss = 2.1712e-02, PNorm = 497.0207, GNorm = 0.1588, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.064191
Epoch 3495
Loss = 3.7881e-02, PNorm = 497.0988, GNorm = 0.8791, lr_0 = 9.9782e-04
Validation binary_cross_entropy = 0.066521
Epoch 3496
Loss = 5.6796e-03, PNorm = 497.1830, GNorm = 0.3821, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.069638
Epoch 3497
Loss = 1.6802e-02, PNorm = 497.2413, GNorm = 0.8240, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.064271
Epoch 3498
Loss = 7.1829e-03, PNorm = 497.3075, GNorm = 0.0814, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.073116
Epoch 3499
Loss = 2.3381e-02, PNorm = 497.3682, GNorm = 2.2536, lr_0 = 9.9781e-04
Loss = 2.6120e-02, PNorm = 497.4376, GNorm = 0.7460, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.071211
Epoch 3500
Loss = 1.7018e-02, PNorm = 497.5111, GNorm = 0.0197, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.087662
Epoch 3501
Loss = 6.7650e-02, PNorm = 497.5650, GNorm = 0.0598, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.068421
Epoch 3502
Loss = 1.8261e-01, PNorm = 497.6382, GNorm = 0.1091, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.092883
Epoch 3503
Loss = 5.0453e-02, PNorm = 497.7574, GNorm = 0.6783, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.104857
Epoch 3504
Loss = 2.8198e-02, PNorm = 497.8555, GNorm = 0.6365, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.110453
Epoch 3505
Loss = 3.9396e-02, PNorm = 497.9388, GNorm = 0.1321, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.128430
Epoch 3506
Loss = 5.6422e-02, PNorm = 498.0158, GNorm = 0.7265, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.075949
Epoch 3507
Loss = 4.2713e-03, PNorm = 498.1253, GNorm = 0.1334, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.090542
Epoch 3508
Loss = 4.0954e-02, PNorm = 498.2285, GNorm = 1.5590, lr_0 = 9.9781e-04
Validation binary_cross_entropy = 0.063606
Epoch 3509
Loss = 2.1692e-02, PNorm = 498.3134, GNorm = 0.6087, lr_0 = 9.9781e-04
Loss = 4.1354e-02, PNorm = 498.4004, GNorm = 2.4656, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.084683
Epoch 3510
Loss = 7.4797e-02, PNorm = 498.4792, GNorm = 4.6325, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.070368
Epoch 3511
Loss = 2.7952e-02, PNorm = 498.5490, GNorm = 2.1198, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.054707
Epoch 3512
Loss = 1.4250e-02, PNorm = 498.6267, GNorm = 0.4709, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.065543
Epoch 3513
Loss = 2.9909e-02, PNorm = 498.6833, GNorm = 0.1663, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.060727
Epoch 3514
Loss = 5.6254e-02, PNorm = 498.7850, GNorm = 0.5897, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.063454
Epoch 3515
Loss = 2.7215e-02, PNorm = 498.9155, GNorm = 0.9911, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.081587
Epoch 3516
Loss = 5.6367e-03, PNorm = 498.9966, GNorm = 0.0685, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.072032
Epoch 3517
Loss = 6.3669e-03, PNorm = 499.0583, GNorm = 0.5530, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.073586
Epoch 3518
Loss = 1.4130e-02, PNorm = 499.1089, GNorm = 0.7567, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.076491
Epoch 3519
Loss = 3.0598e-03, PNorm = 499.1565, GNorm = 0.1605, lr_0 = 9.9780e-04
Loss = 1.3878e-02, PNorm = 499.2068, GNorm = 0.0534, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.075429
Epoch 3520
Loss = 2.1944e-02, PNorm = 499.2548, GNorm = 0.5260, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.074817
Epoch 3521
Loss = 2.9451e-02, PNorm = 499.3022, GNorm = 3.1106, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.069775
Epoch 3522
Loss = 1.6053e-02, PNorm = 499.3882, GNorm = 0.0439, lr_0 = 9.9780e-04
Validation binary_cross_entropy = 0.079991
Epoch 3523
Loss = 4.3232e-02, PNorm = 499.4632, GNorm = 9.0961, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.074427
Epoch 3524
Loss = 6.7202e-02, PNorm = 499.5185, GNorm = 6.1085, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.059176
Epoch 3525
Loss = 4.4458e-02, PNorm = 499.6119, GNorm = 0.2018, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.077179
Epoch 3526
Loss = 2.4026e-02, PNorm = 499.7046, GNorm = 0.5590, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.062187
Epoch 3527
Loss = 3.7968e-02, PNorm = 499.7836, GNorm = 0.5634, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.069309
Epoch 3528
Loss = 1.7766e-03, PNorm = 499.8564, GNorm = 0.0410, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.112192
Epoch 3529
Loss = 6.1120e-04, PNorm = 499.9059, GNorm = 0.0210, lr_0 = 9.9779e-04
Loss = 8.9535e-04, PNorm = 499.9336, GNorm = 0.0011, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.115219
Epoch 3530
Loss = 1.7054e-02, PNorm = 499.9494, GNorm = 0.0674, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.088056
Epoch 3531
Loss = 1.1042e-02, PNorm = 499.9977, GNorm = 0.5782, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.126164
Epoch 3532
Loss = 1.9324e-02, PNorm = 500.0542, GNorm = 0.5966, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.107720
Epoch 3533
Loss = 5.9987e-02, PNorm = 500.1172, GNorm = 0.4401, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.062987
Epoch 3534
Loss = 6.2478e-03, PNorm = 500.2137, GNorm = 0.0559, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.071672
Epoch 3535
Loss = 3.3404e-02, PNorm = 500.2911, GNorm = 0.1489, lr_0 = 9.9779e-04
Validation binary_cross_entropy = 0.066316
Epoch 3536
Loss = 1.6843e-02, PNorm = 500.3502, GNorm = 0.1558, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.098104
Epoch 3537
Loss = 3.5449e-03, PNorm = 500.4003, GNorm = 0.6633, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.161612
Epoch 3538
Loss = 1.4006e-01, PNorm = 500.4374, GNorm = 2.0274, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.082072
Epoch 3539
Loss = 3.5945e-02, PNorm = 500.4634, GNorm = 2.4127, lr_0 = 9.9778e-04
Loss = 1.0727e-02, PNorm = 500.5039, GNorm = 0.0635, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.075845
Epoch 3540
Loss = 9.9475e-02, PNorm = 500.5673, GNorm = 0.2808, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.052441
Epoch 3541
Loss = 7.3916e-02, PNorm = 500.6706, GNorm = 4.7957, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.068800
Epoch 3542
Loss = 5.3111e-02, PNorm = 500.7695, GNorm = 1.1384, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.055651
Epoch 3543
Loss = 2.0719e-02, PNorm = 500.8640, GNorm = 0.1708, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.069055
Epoch 3544
Loss = 1.0702e-01, PNorm = 500.9373, GNorm = 1.7745, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.058472
Epoch 3545
Loss = 3.3922e-02, PNorm = 501.0134, GNorm = 1.4086, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.057029
Epoch 3546
Loss = 2.3028e-02, PNorm = 501.1115, GNorm = 0.1447, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.083534
Epoch 3547
Loss = 5.6287e-03, PNorm = 501.1981, GNorm = 0.3310, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.078250
Epoch 3548
Loss = 7.3132e-03, PNorm = 501.2421, GNorm = 0.3510, lr_0 = 9.9778e-04
Validation binary_cross_entropy = 0.059180
Epoch 3549
Loss = 4.6537e-03, PNorm = 501.2911, GNorm = 0.1971, lr_0 = 9.9778e-04
Loss = 3.4444e-02, PNorm = 501.3554, GNorm = 1.3611, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.062970
Epoch 3550
Loss = 1.0057e-02, PNorm = 501.4427, GNorm = 0.0525, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.081727
Epoch 3551
Loss = 1.6140e-01, PNorm = 501.5177, GNorm = 0.0574, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.054053
Epoch 3552
Loss = 9.0398e-02, PNorm = 501.6451, GNorm = 1.2126, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.083680
Epoch 3553
Loss = 5.8002e-02, PNorm = 501.7693, GNorm = 0.6604, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.077611
Epoch 3554
Loss = 5.4512e-02, PNorm = 501.8673, GNorm = 1.3592, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.069239
Epoch 3555
Loss = 1.9666e-02, PNorm = 501.9568, GNorm = 1.3583, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.075043
Epoch 3556
Loss = 1.1858e-01, PNorm = 502.0363, GNorm = 0.7094, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.061891
Epoch 3557
Loss = 2.2474e-02, PNorm = 502.1041, GNorm = 0.1934, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.057474
Epoch 3558
Loss = 3.7024e-02, PNorm = 502.1707, GNorm = 1.7325, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.056846
Epoch 3559
Loss = 2.8635e-02, PNorm = 502.2497, GNorm = 0.9670, lr_0 = 9.9777e-04
Loss = 5.5811e-02, PNorm = 502.3158, GNorm = 1.2313, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.053802
Epoch 3560
Loss = 1.5646e-02, PNorm = 502.3859, GNorm = 0.2494, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.064721
Epoch 3561
Loss = 2.0536e-01, PNorm = 502.4853, GNorm = 2.6952, lr_0 = 9.9777e-04
Validation binary_cross_entropy = 0.051834
Epoch 3562
Loss = 3.0566e-02, PNorm = 502.6229, GNorm = 0.3660, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.060792
Epoch 3563
Loss = 3.3341e-02, PNorm = 502.7121, GNorm = 0.9000, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.075656
Epoch 3564
Loss = 7.3066e-02, PNorm = 502.7731, GNorm = 2.1172, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.076234
Epoch 3565
Loss = 4.7677e-02, PNorm = 502.8382, GNorm = 2.3472, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.058767
Epoch 3566
Loss = 4.4574e-02, PNorm = 502.9125, GNorm = 1.0034, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.054664
Epoch 3567
Loss = 3.5865e-02, PNorm = 502.9876, GNorm = 2.4575, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.088724
Epoch 3568
Loss = 6.6644e-03, PNorm = 503.0626, GNorm = 0.0695, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.087389
Epoch 3569
Loss = 2.2595e-02, PNorm = 503.1071, GNorm = 0.8958, lr_0 = 9.9776e-04
Loss = 4.5077e-02, PNorm = 503.1542, GNorm = 0.2991, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.059821
Epoch 3570
Loss = 5.7135e-02, PNorm = 503.2360, GNorm = 0.9034, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.061153
Epoch 3571
Loss = 6.3751e-02, PNorm = 503.3369, GNorm = 0.4310, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.056323
Epoch 3572
Loss = 4.2969e-02, PNorm = 503.4278, GNorm = 0.8636, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.064960
Epoch 3573
Loss = 9.8621e-02, PNorm = 503.4952, GNorm = 1.3010, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.059874
Epoch 3574
Loss = 2.3264e-02, PNorm = 503.5820, GNorm = 1.1430, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.069859
Epoch 3575
Loss = 2.6847e-02, PNorm = 503.6532, GNorm = 2.8992, lr_0 = 9.9776e-04
Validation binary_cross_entropy = 0.066271
Epoch 3576
Loss = 1.0378e-02, PNorm = 503.7043, GNorm = 0.2498, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.085627
Epoch 3577
Loss = 5.5013e-02, PNorm = 503.7640, GNorm = 1.0479, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.126037
Epoch 3578
Loss = 2.7234e-02, PNorm = 503.8126, GNorm = 0.3377, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.056890
Epoch 3579
Loss = 4.0302e-02, PNorm = 503.8792, GNorm = 1.1943, lr_0 = 9.9775e-04
Loss = 2.6946e-02, PNorm = 503.9913, GNorm = 0.4165, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.102189
Epoch 3580
Loss = 5.3843e-02, PNorm = 504.0569, GNorm = 0.7944, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.081976
Epoch 3581
Loss = 2.7226e-02, PNorm = 504.1055, GNorm = 0.5024, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.064890
Epoch 3582
Loss = 7.6709e-02, PNorm = 504.1755, GNorm = 4.9197, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.066380
Epoch 3583
Loss = 2.7514e-02, PNorm = 504.2407, GNorm = 0.2850, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.052367
Epoch 3584
Loss = 2.5885e-02, PNorm = 504.2995, GNorm = 0.5510, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.056192
Epoch 3585
Loss = 3.1587e-02, PNorm = 504.3639, GNorm = 0.2283, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.066402
Epoch 3586
Loss = 6.3840e-03, PNorm = 504.4211, GNorm = 0.1366, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.076737
Epoch 3587
Loss = 1.0515e-02, PNorm = 504.4715, GNorm = 0.0874, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.058357
Epoch 3588
Loss = 3.7454e-02, PNorm = 504.5435, GNorm = 2.4862, lr_0 = 9.9775e-04
Validation binary_cross_entropy = 0.096678
Epoch 3589
Loss = 1.3538e-02, PNorm = 504.6274, GNorm = 0.6032, lr_0 = 9.9774e-04
Loss = 5.4750e-03, PNorm = 504.7013, GNorm = 0.0518, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.093883
Epoch 3590
Loss = 2.9356e-02, PNorm = 504.7496, GNorm = 0.1766, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.068314
Epoch 3591
Loss = 5.1190e-02, PNorm = 504.8016, GNorm = 0.8780, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.082363
Epoch 3592
Loss = 2.3621e-02, PNorm = 504.8617, GNorm = 0.2705, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.086493
Epoch 3593
Loss = 3.5278e-02, PNorm = 504.9232, GNorm = 3.6843, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.098697
Epoch 3594
Loss = 7.0774e-02, PNorm = 504.9799, GNorm = 2.8705, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.069396
Epoch 3595
Loss = 5.3459e-02, PNorm = 505.0565, GNorm = 0.7911, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.062090
Epoch 3596
Loss = 1.8171e-02, PNorm = 505.1414, GNorm = 0.2395, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.090130
Epoch 3597
Loss = 4.4264e-03, PNorm = 505.2140, GNorm = 0.2548, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.067110
Epoch 3598
Loss = 5.3054e-03, PNorm = 505.2635, GNorm = 0.4154, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.075659
Epoch 3599
Loss = 5.2440e-03, PNorm = 505.3228, GNorm = 0.2092, lr_0 = 9.9774e-04
Loss = 1.9980e-02, PNorm = 505.3819, GNorm = 0.0103, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.102819
Epoch 3600
Loss = 6.4755e-02, PNorm = 505.4124, GNorm = 0.7704, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.070103
Epoch 3601
Loss = 2.5402e-02, PNorm = 505.4804, GNorm = 0.9861, lr_0 = 9.9774e-04
Validation binary_cross_entropy = 0.066709
Epoch 3602
Loss = 3.8768e-02, PNorm = 505.5913, GNorm = 0.2666, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.089630
Epoch 3603
Loss = 4.3996e-02, PNorm = 505.6647, GNorm = 0.2515, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.069875
Epoch 3604
Loss = 2.4692e-02, PNorm = 505.7321, GNorm = 0.3945, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.065579
Epoch 3605
Loss = 2.8026e-02, PNorm = 505.7920, GNorm = 0.9612, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.065756
Epoch 3606
Loss = 4.7586e-02, PNorm = 505.8374, GNorm = 0.6442, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.066725
Epoch 3607
Loss = 2.3241e-02, PNorm = 505.9250, GNorm = 0.3470, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.067959
Epoch 3608
Loss = 1.4204e-02, PNorm = 505.9967, GNorm = 0.0921, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.078609
Epoch 3609
Loss = 2.6101e-02, PNorm = 506.0816, GNorm = 1.3988, lr_0 = 9.9773e-04
Loss = 1.9323e-02, PNorm = 506.1432, GNorm = 0.2389, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.070825
Epoch 3610
Loss = 7.7265e-03, PNorm = 506.1898, GNorm = 0.0221, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.078134
Epoch 3611
Loss = 4.6782e-02, PNorm = 506.2401, GNorm = 5.9894, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.091340
Epoch 3612
Loss = 1.0519e-01, PNorm = 506.2938, GNorm = 21.7038, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.058367
Epoch 3613
Loss = 2.9023e-02, PNorm = 506.3806, GNorm = 3.8254, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.057223
Epoch 3614
Loss = 8.5618e-03, PNorm = 506.4709, GNorm = 0.5075, lr_0 = 9.9773e-04
Validation binary_cross_entropy = 0.073593
Epoch 3615
Loss = 1.6374e-02, PNorm = 506.5355, GNorm = 0.0550, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.069685
Epoch 3616
Loss = 4.7312e-02, PNorm = 506.5848, GNorm = 0.2057, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.056165
Epoch 3617
Loss = 3.6420e-02, PNorm = 506.6424, GNorm = 0.8923, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.062286
Epoch 3618
Loss = 6.8937e-03, PNorm = 506.7077, GNorm = 0.4023, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.061157
Epoch 3619
Loss = 5.4957e-03, PNorm = 506.7660, GNorm = 0.2221, lr_0 = 9.9772e-04
Loss = 1.6445e-02, PNorm = 506.8159, GNorm = 0.3914, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.069142
Epoch 3620
Loss = 1.3516e-02, PNorm = 506.8661, GNorm = 0.5069, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.081296
Epoch 3621
Loss = 6.0254e-02, PNorm = 506.9118, GNorm = 0.0306, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.058889
Epoch 3622
Loss = 2.1964e-02, PNorm = 506.9892, GNorm = 0.4717, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.062477
Epoch 3623
Loss = 4.5754e-02, PNorm = 507.0602, GNorm = 1.2632, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.062672
Epoch 3624
Loss = 2.6766e-02, PNorm = 507.1217, GNorm = 0.3382, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.056165
Epoch 3625
Loss = 4.1687e-02, PNorm = 507.1915, GNorm = 0.1284, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.045546
Epoch 3626
Loss = 2.2153e-02, PNorm = 507.2590, GNorm = 1.3287, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.047963
Epoch 3627
Loss = 5.2337e-02, PNorm = 507.3292, GNorm = 0.1768, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.051216
Epoch 3628
Loss = 5.7784e-02, PNorm = 507.3835, GNorm = 0.5929, lr_0 = 9.9772e-04
Validation binary_cross_entropy = 0.053555
Epoch 3629
Loss = 3.0347e-02, PNorm = 507.4352, GNorm = 1.0549, lr_0 = 9.9771e-04
Loss = 8.9354e-03, PNorm = 507.4846, GNorm = 0.0627, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.060550
Epoch 3630
Loss = 3.7149e-02, PNorm = 507.5276, GNorm = 0.1850, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.062982
Epoch 3631
Loss = 1.1542e-02, PNorm = 507.5689, GNorm = 0.0392, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.062476
Epoch 3632
Loss = 2.8516e-02, PNorm = 507.6043, GNorm = 0.8905, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.062840
Epoch 3633
Loss = 6.6463e-03, PNorm = 507.6378, GNorm = 0.0292, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.063418
Epoch 3634
Loss = 2.2641e-02, PNorm = 507.6727, GNorm = 0.0580, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.069709
Epoch 3635
Loss = 5.7860e-03, PNorm = 507.7064, GNorm = 1.4248, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.074335
Epoch 3636
Loss = 3.3269e-02, PNorm = 507.7413, GNorm = 2.1629, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.088963
Epoch 3637
Loss = 4.4911e-02, PNorm = 507.7998, GNorm = 0.0214, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.059783
Epoch 3638
Loss = 3.4397e-02, PNorm = 507.8764, GNorm = 2.2114, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.088146
Epoch 3639
Loss = 1.4009e-03, PNorm = 507.9714, GNorm = 0.0335, lr_0 = 9.9771e-04
Loss = 9.4766e-03, PNorm = 508.0263, GNorm = 0.0601, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.106249
Epoch 3640
Loss = 9.4248e-03, PNorm = 508.0640, GNorm = 0.0602, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.132814
Epoch 3641
Loss = 1.3782e-02, PNorm = 508.0996, GNorm = 0.0657, lr_0 = 9.9771e-04
Validation binary_cross_entropy = 0.163726
Epoch 3642
Loss = 3.9275e-03, PNorm = 508.1400, GNorm = 0.4898, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.126083
Epoch 3643
Loss = 4.8010e-02, PNorm = 508.1950, GNorm = 0.0338, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.083110
Epoch 3644
Loss = 7.5671e-03, PNorm = 508.2625, GNorm = 0.1603, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.102791
Epoch 3645
Loss = 1.5528e-03, PNorm = 508.3188, GNorm = 0.0115, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.092874
Epoch 3646
Loss = 8.0302e-03, PNorm = 508.3510, GNorm = 0.5179, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.058549
Epoch 3647
Loss = 3.2232e-02, PNorm = 508.4099, GNorm = 1.1888, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.048860
Epoch 3648
Loss = 7.6017e-03, PNorm = 508.4821, GNorm = 0.1178, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.047829
Epoch 3649
Loss = 1.7944e-02, PNorm = 508.5561, GNorm = 0.6744, lr_0 = 9.9770e-04
Loss = 2.6126e-02, PNorm = 508.6234, GNorm = 0.3031, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.050406
Epoch 3650
Loss = 2.8459e-02, PNorm = 508.6921, GNorm = 0.3323, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.059034
Epoch 3651
Loss = 1.9567e-02, PNorm = 508.7445, GNorm = 0.5373, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.061276
Epoch 3652
Loss = 2.0175e-02, PNorm = 508.7850, GNorm = 0.1183, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.059508
Epoch 3653
Loss = 3.2102e-02, PNorm = 508.8201, GNorm = 1.0324, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.056548
Epoch 3654
Loss = 3.8422e-02, PNorm = 508.8653, GNorm = 0.4416, lr_0 = 9.9770e-04
Validation binary_cross_entropy = 0.058354
Epoch 3655
Loss = 2.1402e-02, PNorm = 508.9088, GNorm = 0.9070, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.064472
Epoch 3656
Loss = 1.6463e-01, PNorm = 508.9542, GNorm = 0.2627, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.051823
Epoch 3657
Loss = 9.8067e-03, PNorm = 509.0479, GNorm = 0.6648, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.058660
Epoch 3658
Loss = 3.9184e-02, PNorm = 509.1278, GNorm = 0.2938, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.052279
Epoch 3659
Loss = 4.2664e-03, PNorm = 509.2035, GNorm = 0.1186, lr_0 = 9.9769e-04
Loss = 1.8504e-02, PNorm = 509.2731, GNorm = 0.1699, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.060126
Epoch 3660
Loss = 2.2009e-02, PNorm = 509.3275, GNorm = 1.9096, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.066522
Epoch 3661
Loss = 1.9507e-02, PNorm = 509.3917, GNorm = 0.0288, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.072587
Epoch 3662
Loss = 4.0975e-02, PNorm = 509.4512, GNorm = 0.4186, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.070212
Epoch 3663
Loss = 1.3992e-01, PNorm = 509.4966, GNorm = 0.0393, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.065230
Epoch 3664
Loss = 3.8074e-02, PNorm = 509.5689, GNorm = 3.2203, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.068840
Epoch 3665
Loss = 4.2123e-02, PNorm = 509.6762, GNorm = 0.2962, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.071438
Epoch 3666
Loss = 1.1321e-02, PNorm = 509.7739, GNorm = 1.4087, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.067516
Epoch 3667
Loss = 5.2269e-02, PNorm = 509.8433, GNorm = 2.7242, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.079186
Epoch 3668
Loss = 3.5571e-03, PNorm = 509.9160, GNorm = 0.4146, lr_0 = 9.9769e-04
Validation binary_cross_entropy = 0.074678
Epoch 3669
Loss = 3.7766e-03, PNorm = 509.9659, GNorm = 0.1016, lr_0 = 9.9768e-04
Loss = 5.1359e-02, PNorm = 510.0206, GNorm = 0.3901, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.064912
Epoch 3670
Loss = 3.7088e-02, PNorm = 510.0849, GNorm = 0.6411, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.062375
Epoch 3671
Loss = 2.8359e-02, PNorm = 510.1505, GNorm = 1.5580, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.071551
Epoch 3672
Loss = 1.4596e-02, PNorm = 510.2170, GNorm = 0.0338, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.091308
Epoch 3673
Loss = 3.5197e-02, PNorm = 510.2669, GNorm = 0.2303, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.068319
Epoch 3674
Loss = 1.6003e-02, PNorm = 510.3302, GNorm = 0.6512, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.085767
Epoch 3675
Loss = 2.2586e-02, PNorm = 510.4018, GNorm = 0.0318, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.098541
Epoch 3676
Loss = 3.5125e-02, PNorm = 510.4558, GNorm = 0.2997, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.081509
Epoch 3677
Loss = 2.9855e-02, PNorm = 510.4987, GNorm = 0.9974, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.099443
Epoch 3678
Loss = 2.4641e-03, PNorm = 510.5652, GNorm = 0.2200, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.118109
Epoch 3679
Loss = 5.0012e-02, PNorm = 510.6289, GNorm = 1.3691, lr_0 = 9.9768e-04
Loss = 5.0857e-02, PNorm = 510.6806, GNorm = 0.0341, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.098401
Epoch 3680
Loss = 3.6224e-02, PNorm = 510.7215, GNorm = 0.4827, lr_0 = 9.9768e-04
Validation binary_cross_entropy = 0.081770
Epoch 3681
Loss = 2.6719e-02, PNorm = 510.7619, GNorm = 0.1050, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.063255
Epoch 3682
Loss = 6.0958e-03, PNorm = 510.8189, GNorm = 0.0470, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.078253
Epoch 3683
Loss = 3.9077e-02, PNorm = 510.8790, GNorm = 0.0364, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.061724
Epoch 3684
Loss = 5.0100e-03, PNorm = 510.9380, GNorm = 0.1095, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.067894
Epoch 3685
Loss = 8.6547e-02, PNorm = 510.9869, GNorm = 18.3547, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.048320
Epoch 3686
Loss = 1.2478e-02, PNorm = 511.1718, GNorm = 0.0305, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.058596
Epoch 3687
Loss = 1.3546e-01, PNorm = 511.3261, GNorm = 4.5951, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.052847
Epoch 3688
Loss = 2.6329e-02, PNorm = 511.4461, GNorm = 0.3439, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.063680
Epoch 3689
Loss = 7.4655e-04, PNorm = 511.5461, GNorm = 0.0272, lr_0 = 9.9767e-04
Loss = 1.2965e-02, PNorm = 511.6252, GNorm = 0.2980, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.079173
Epoch 3690
Loss = 1.5966e-02, PNorm = 511.6785, GNorm = 0.2244, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.086305
Epoch 3691
Loss = 5.1356e-02, PNorm = 511.7151, GNorm = 2.2560, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.062756
Epoch 3692
Loss = 2.8704e-02, PNorm = 511.7741, GNorm = 0.4398, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.061278
Epoch 3693
Loss = 2.4382e-02, PNorm = 511.8545, GNorm = 0.6016, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.064523
Epoch 3694
Loss = 2.1984e-02, PNorm = 511.9628, GNorm = 2.7329, lr_0 = 9.9767e-04
Validation binary_cross_entropy = 0.091590
Epoch 3695
Loss = 1.4443e-02, PNorm = 512.0277, GNorm = 0.8795, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.071636
Epoch 3696
Loss = 2.6510e-02, PNorm = 512.0825, GNorm = 0.3669, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.069515
Epoch 3697
Loss = 4.0464e-02, PNorm = 512.1422, GNorm = 1.0022, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.083666
Epoch 3698
Loss = 1.2266e-03, PNorm = 512.2201, GNorm = 0.1103, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.087202
Epoch 3699
Loss = 1.8453e-03, PNorm = 512.2594, GNorm = 0.1260, lr_0 = 9.9766e-04
Loss = 1.0693e-02, PNorm = 512.2813, GNorm = 0.2208, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.076201
Epoch 3700
Loss = 6.0659e-02, PNorm = 512.3189, GNorm = 1.0087, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.063624
Epoch 3701
Loss = 1.5026e-02, PNorm = 512.3702, GNorm = 0.5588, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.061714
Epoch 3702
Loss = 1.8894e-02, PNorm = 512.4143, GNorm = 0.1085, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.062443
Epoch 3703
Loss = 1.2521e-02, PNorm = 512.4600, GNorm = 0.2209, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.070864
Epoch 3704
Loss = 2.7236e-02, PNorm = 512.5013, GNorm = 0.0581, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.079213
Epoch 3705
Loss = 5.4375e-02, PNorm = 512.5317, GNorm = 0.2362, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.066783
Epoch 3706
Loss = 1.7606e-02, PNorm = 512.5631, GNorm = 1.2269, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.071278
Epoch 3707
Loss = 4.3445e-02, PNorm = 512.6161, GNorm = 1.3737, lr_0 = 9.9766e-04
Validation binary_cross_entropy = 0.072220
Epoch 3708
Loss = 3.5272e-03, PNorm = 512.6752, GNorm = 0.1402, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.068057
Epoch 3709
Loss = 7.3719e-03, PNorm = 512.7415, GNorm = 0.1381, lr_0 = 9.9765e-04
Loss = 5.8673e-03, PNorm = 512.8039, GNorm = 0.8120, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.075333
Epoch 3710
Loss = 4.1229e-02, PNorm = 512.8620, GNorm = 0.6132, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.079510
Epoch 3711
Loss = 5.9211e-02, PNorm = 512.9124, GNorm = 2.7939, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.064779
Epoch 3712
Loss = 3.9802e-02, PNorm = 512.9920, GNorm = 0.2089, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.061858
Epoch 3713
Loss = 3.1810e-02, PNorm = 513.0733, GNorm = 0.1484, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.063295
Epoch 3714
Loss = 1.9904e-02, PNorm = 513.1476, GNorm = 0.2234, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.065288
Epoch 3715
Loss = 4.0295e-02, PNorm = 513.2089, GNorm = 0.0484, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.065124
Epoch 3716
Loss = 7.4266e-03, PNorm = 513.2527, GNorm = 0.0298, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.067137
Epoch 3717
Loss = 4.6100e-02, PNorm = 513.2875, GNorm = 1.7359, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.060806
Epoch 3718
Loss = 5.5326e-03, PNorm = 513.3310, GNorm = 0.1764, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.064496
Epoch 3719
Loss = 2.6303e-03, PNorm = 513.3785, GNorm = 0.0739, lr_0 = 9.9765e-04
Loss = 8.6068e-03, PNorm = 513.4280, GNorm = 2.9069, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.081048
Epoch 3720
Loss = 3.0507e-02, PNorm = 513.4754, GNorm = 0.1263, lr_0 = 9.9765e-04
Validation binary_cross_entropy = 0.077578
Epoch 3721
Loss = 8.5598e-02, PNorm = 513.6046, GNorm = 1.0199, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.076277
Epoch 3722
Loss = 3.5915e-02, PNorm = 513.7257, GNorm = 1.9031, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.070011
Epoch 3723
Loss = 4.6517e-02, PNorm = 513.8170, GNorm = 4.8379, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.073096
Epoch 3724
Loss = 5.3341e-02, PNorm = 513.9011, GNorm = 1.1404, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.061717
Epoch 3725
Loss = 2.7654e-02, PNorm = 513.9892, GNorm = 0.3600, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.067979
Epoch 3726
Loss = 2.7272e-02, PNorm = 514.0540, GNorm = 0.2670, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.062379
Epoch 3727
Loss = 7.4681e-03, PNorm = 514.1405, GNorm = 0.2633, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.067440
Epoch 3728
Loss = 7.5428e-03, PNorm = 514.2087, GNorm = 0.1375, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.061276
Epoch 3729
Loss = 3.5736e-02, PNorm = 514.2554, GNorm = 1.0531, lr_0 = 9.9764e-04
Loss = 2.7263e-02, PNorm = 514.3042, GNorm = 0.8206, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.060709
Epoch 3730
Loss = 1.0372e-01, PNorm = 514.4127, GNorm = 0.5605, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.051416
Epoch 3731
Loss = 3.0050e-02, PNorm = 514.5230, GNorm = 0.4039, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.057130
Epoch 3732
Loss = 1.5552e-02, PNorm = 514.5923, GNorm = 0.0577, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.064887
Epoch 3733
Loss = 1.7683e-02, PNorm = 514.6684, GNorm = 0.0053, lr_0 = 9.9764e-04
Validation binary_cross_entropy = 0.099560
Epoch 3734
Loss = 9.6691e-03, PNorm = 514.7173, GNorm = 0.0059, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.088624
Epoch 3735
Loss = 1.5994e-02, PNorm = 514.7774, GNorm = 0.7939, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.082334
Epoch 3736
Loss = 4.8287e-02, PNorm = 514.8363, GNorm = 1.1543, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.097951
Epoch 3737
Loss = 1.1471e-02, PNorm = 514.8849, GNorm = 0.4619, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.074455
Epoch 3738
Loss = 4.7051e-02, PNorm = 514.9202, GNorm = 0.0757, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.055294
Epoch 3739
Loss = 2.8716e-03, PNorm = 514.9717, GNorm = 0.2515, lr_0 = 9.9763e-04
Loss = 3.2664e-02, PNorm = 515.0142, GNorm = 2.2267, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.052393
Epoch 3740
Loss = 2.6731e-02, PNorm = 515.0694, GNorm = 0.1865, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.060813
Epoch 3741
Loss = 1.8650e-02, PNorm = 515.1130, GNorm = 0.0222, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.056994
Epoch 3742
Loss = 3.0108e-02, PNorm = 515.1606, GNorm = 0.0202, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.056795
Epoch 3743
Loss = 3.8386e-02, PNorm = 515.2272, GNorm = 1.7432, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.062237
Epoch 3744
Loss = 7.9362e-03, PNorm = 515.2792, GNorm = 0.0277, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.065416
Epoch 3745
Loss = 1.4036e-02, PNorm = 515.3143, GNorm = 0.0599, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.076016
Epoch 3746
Loss = 2.3610e-02, PNorm = 515.3453, GNorm = 0.5415, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.071632
Epoch 3747
Loss = 6.1007e-04, PNorm = 515.3813, GNorm = 0.0162, lr_0 = 9.9763e-04
Validation binary_cross_entropy = 0.075039
Epoch 3748
Loss = 1.9617e-03, PNorm = 515.4420, GNorm = 0.2306, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.059615
Epoch 3749
Loss = 5.5412e-03, PNorm = 515.5283, GNorm = 0.1216, lr_0 = 9.9762e-04
Loss = 3.3049e-02, PNorm = 515.6095, GNorm = 1.4954, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.059607
Epoch 3750
Loss = 4.2076e-02, PNorm = 515.6826, GNorm = 2.5809, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.068817
Epoch 3751
Loss = 4.0258e-02, PNorm = 515.7489, GNorm = 1.2527, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.070578
Epoch 3752
Loss = 3.7491e-02, PNorm = 515.8006, GNorm = 1.8135, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.061097
Epoch 3753
Loss = 1.4496e-02, PNorm = 515.8581, GNorm = 1.7419, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.075997
Epoch 3754
Loss = 3.7715e-02, PNorm = 515.9080, GNorm = 0.1739, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.079271
Epoch 3755
Loss = 4.5466e-02, PNorm = 515.9463, GNorm = 0.1011, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.053755
Epoch 3756
Loss = 2.0002e-02, PNorm = 515.9841, GNorm = 0.2068, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.057069
Epoch 3757
Loss = 9.1628e-02, PNorm = 516.0484, GNorm = 7.9991, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.058594
Epoch 3758
Loss = 9.9717e-03, PNorm = 516.1115, GNorm = 0.2833, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.061789
Epoch 3759
Loss = 1.9428e-02, PNorm = 516.1639, GNorm = 1.1329, lr_0 = 9.9762e-04
Loss = 4.9122e-03, PNorm = 516.2124, GNorm = 0.0777, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.086032
Epoch 3760
Loss = 5.1024e-02, PNorm = 516.2515, GNorm = 1.8946, lr_0 = 9.9762e-04
Validation binary_cross_entropy = 0.067558
Epoch 3761
Loss = 2.6311e-02, PNorm = 516.3558, GNorm = 0.8989, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.075105
Epoch 3762
Loss = 1.2230e-02, PNorm = 516.4401, GNorm = 0.0051, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.088846
Epoch 3763
Loss = 3.3273e-02, PNorm = 516.4907, GNorm = 0.4189, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.070294
Epoch 3764
Loss = 2.0853e-02, PNorm = 516.5411, GNorm = 1.3231, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.069875
Epoch 3765
Loss = 3.4677e-03, PNorm = 516.6346, GNorm = 0.2355, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.121839
Epoch 3766
Loss = 8.0991e-02, PNorm = 516.7059, GNorm = 0.5706, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.065697
Epoch 3767
Loss = 8.9338e-02, PNorm = 516.7830, GNorm = 2.1098, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.080324
Epoch 3768
Loss = 5.5299e-02, PNorm = 516.8591, GNorm = 1.2243, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.060759
Epoch 3769
Loss = 1.5761e-02, PNorm = 516.9190, GNorm = 0.4150, lr_0 = 9.9761e-04
Loss = 2.1685e-02, PNorm = 516.9766, GNorm = 0.0761, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.060484
Epoch 3770
Loss = 4.9983e-02, PNorm = 517.0251, GNorm = 0.3758, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.059411
Epoch 3771
Loss = 2.9316e-02, PNorm = 517.0622, GNorm = 1.3647, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.056061
Epoch 3772
Loss = 3.1707e-02, PNorm = 517.1030, GNorm = 0.6062, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.056155
Epoch 3773
Loss = 2.1160e-02, PNorm = 517.1553, GNorm = 0.3874, lr_0 = 9.9761e-04
Validation binary_cross_entropy = 0.061463
Epoch 3774
Loss = 1.2650e-02, PNorm = 517.1862, GNorm = 0.0175, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.061966
Epoch 3775
Loss = 3.4951e-02, PNorm = 517.2124, GNorm = 0.3167, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.059578
Epoch 3776
Loss = 3.5875e-02, PNorm = 517.2432, GNorm = 0.3141, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.059161
Epoch 3777
Loss = 5.0455e-02, PNorm = 517.2806, GNorm = 1.0279, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.057735
Epoch 3778
Loss = 4.5965e-03, PNorm = 517.3131, GNorm = 0.0952, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.057144
Epoch 3779
Loss = 2.7490e-03, PNorm = 517.3563, GNorm = 0.1208, lr_0 = 9.9760e-04
Loss = 3.1838e-02, PNorm = 517.4292, GNorm = 0.1262, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.060166
Epoch 3780
Loss = 5.7404e-03, PNorm = 517.4828, GNorm = 0.0203, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.063051
Epoch 3781
Loss = 2.3346e-02, PNorm = 517.5176, GNorm = 0.1003, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.062221
Epoch 3782
Loss = 1.7438e-02, PNorm = 517.5591, GNorm = 0.2342, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.075653
Epoch 3783
Loss = 4.5294e-02, PNorm = 517.5923, GNorm = 3.0865, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.062568
Epoch 3784
Loss = 6.6722e-03, PNorm = 517.6315, GNorm = 0.1205, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.059670
Epoch 3785
Loss = 2.3057e-02, PNorm = 517.6657, GNorm = 0.0998, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.059991
Epoch 3786
Loss = 1.1936e-02, PNorm = 517.6967, GNorm = 0.8816, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.063601
Epoch 3787
Loss = 3.7668e-02, PNorm = 517.7350, GNorm = 0.1518, lr_0 = 9.9760e-04
Validation binary_cross_entropy = 0.068067
Epoch 3788
Loss = 1.2948e-02, PNorm = 517.7649, GNorm = 0.0991, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.079316
Epoch 3789
Loss = 2.5023e-03, PNorm = 517.7904, GNorm = 0.1832, lr_0 = 9.9759e-04
Loss = 7.6161e-03, PNorm = 517.8086, GNorm = 0.0603, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.078682
Epoch 3790
Loss = 2.4004e-02, PNorm = 517.8366, GNorm = 0.8436, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.075253
Epoch 3791
Loss = 4.9212e-02, PNorm = 517.8897, GNorm = 0.0817, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.070654
Epoch 3792
Loss = 2.2017e-02, PNorm = 517.9346, GNorm = 0.4557, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.066368
Epoch 3793
Loss = 2.4265e-02, PNorm = 517.9632, GNorm = 0.2143, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.057782
Epoch 3794
Loss = 1.8312e-02, PNorm = 518.0193, GNorm = 0.1058, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.071739
Epoch 3795
Loss = 3.7453e-02, PNorm = 518.0813, GNorm = 0.0790, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.076633
Epoch 3796
Loss = 5.8713e-03, PNorm = 518.1264, GNorm = 0.0084, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.060955
Epoch 3797
Loss = 2.1823e-02, PNorm = 518.1678, GNorm = 0.2591, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.072698
Epoch 3798
Loss = 6.1755e-03, PNorm = 518.2119, GNorm = 0.1615, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.065420
Epoch 3799
Loss = 6.8451e-04, PNorm = 518.2366, GNorm = 0.0180, lr_0 = 9.9759e-04
Loss = 3.0067e-02, PNorm = 518.2700, GNorm = 0.2591, lr_0 = 9.9759e-04
Validation binary_cross_entropy = 0.059176
Epoch 3800
Loss = 1.6172e-02, PNorm = 518.3249, GNorm = 0.4193, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.061277
Epoch 3801
Loss = 4.2970e-03, PNorm = 518.3829, GNorm = 0.1127, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.078273
Epoch 3802
Loss = 8.1502e-02, PNorm = 518.4197, GNorm = 0.8637, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.075832
Epoch 3803
Loss = 5.1692e-02, PNorm = 518.4504, GNorm = 0.5767, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.065418
Epoch 3804
Loss = 2.1311e-02, PNorm = 518.4875, GNorm = 0.4659, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.061836
Epoch 3805
Loss = 1.3201e-02, PNorm = 518.5309, GNorm = 0.3993, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.064971
Epoch 3806
Loss = 6.2062e-02, PNorm = 518.5718, GNorm = 0.1063, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.063998
Epoch 3807
Loss = 3.1346e-02, PNorm = 518.6303, GNorm = 0.6627, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.064977
Epoch 3808
Loss = 1.9095e-03, PNorm = 518.6807, GNorm = 0.0859, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.083703
Epoch 3809
Loss = 1.3250e-02, PNorm = 518.7308, GNorm = 0.6526, lr_0 = 9.9758e-04
Loss = 1.0825e-02, PNorm = 518.7742, GNorm = 0.0239, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.152267
Epoch 3810
Loss = 1.1969e-03, PNorm = 518.8032, GNorm = 0.0582, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.225338
Epoch 3811
Loss = 3.2706e-03, PNorm = 518.8207, GNorm = 0.0172, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.311133
Epoch 3812
Loss = 4.0762e-02, PNorm = 518.8286, GNorm = 3.6273, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.088030
Epoch 3813
Loss = 1.2976e-01, PNorm = 518.8988, GNorm = 5.8207, lr_0 = 9.9758e-04
Validation binary_cross_entropy = 0.057557
Epoch 3814
Loss = 1.8534e-02, PNorm = 518.9987, GNorm = 0.6759, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.056556
Epoch 3815
Loss = 3.7804e-02, PNorm = 519.0615, GNorm = 2.2927, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.052166
Epoch 3816
Loss = 3.2405e-02, PNorm = 519.1127, GNorm = 0.3269, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.050900
Epoch 3817
Loss = 1.4856e-02, PNorm = 519.1890, GNorm = 0.2019, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.056768
Epoch 3818
Loss = 5.8543e-03, PNorm = 519.2532, GNorm = 0.3054, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.073447
Epoch 3819
Loss = 2.1660e-03, PNorm = 519.3005, GNorm = 0.1576, lr_0 = 9.9757e-04
Loss = 3.1226e-02, PNorm = 519.3558, GNorm = 0.6970, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.063054
Epoch 3820
Loss = 4.0606e-02, PNorm = 519.4170, GNorm = 0.2355, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.060040
Epoch 3821
Loss = 1.2858e-02, PNorm = 519.4682, GNorm = 0.4457, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.053926
Epoch 3822
Loss = 1.2678e-02, PNorm = 519.5311, GNorm = 0.0489, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.053155
Epoch 3823
Loss = 3.0938e-02, PNorm = 519.6146, GNorm = 1.2635, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.064035
Epoch 3824
Loss = 6.7968e-03, PNorm = 519.6663, GNorm = 0.1872, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.060719
Epoch 3825
Loss = 5.8124e-02, PNorm = 519.7025, GNorm = 4.9395, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.064356
Epoch 3826
Loss = 2.0878e-02, PNorm = 519.7674, GNorm = 0.1970, lr_0 = 9.9757e-04
Validation binary_cross_entropy = 0.073575
Epoch 3827
Loss = 1.1980e-02, PNorm = 519.8266, GNorm = 0.6688, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.087270
Epoch 3828
Loss = 4.5889e-02, PNorm = 519.8689, GNorm = 0.0179, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.069521
Epoch 3829
Loss = 1.5528e-03, PNorm = 519.9244, GNorm = 0.0509, lr_0 = 9.9756e-04
Loss = 4.3532e-02, PNorm = 519.9844, GNorm = 1.4057, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.061818
Epoch 3830
Loss = 1.4932e-02, PNorm = 520.0416, GNorm = 0.1341, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.060891
Epoch 3831
Loss = 4.3915e-02, PNorm = 520.0983, GNorm = 0.2601, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.063163
Epoch 3832
Loss = 2.3811e-02, PNorm = 520.1540, GNorm = 2.7231, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.072350
Epoch 3833
Loss = 2.3655e-02, PNorm = 520.2144, GNorm = 0.0141, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.064890
Epoch 3834
Loss = 2.7999e-02, PNorm = 520.2631, GNorm = 0.7392, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.064835
Epoch 3835
Loss = 1.2217e-02, PNorm = 520.3319, GNorm = 1.1855, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.080737
Epoch 3836
Loss = 3.0805e-02, PNorm = 520.3817, GNorm = 1.1270, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.079585
Epoch 3837
Loss = 4.2604e-03, PNorm = 520.4258, GNorm = 0.0180, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.095494
Epoch 3838
Loss = 2.7168e-03, PNorm = 520.4779, GNorm = 0.2444, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.095681
Epoch 3839
Loss = 2.3317e-04, PNorm = 520.5300, GNorm = 0.0147, lr_0 = 9.9756e-04
Loss = 1.3962e-02, PNorm = 520.5754, GNorm = 1.3882, lr_0 = 9.9756e-04
Validation binary_cross_entropy = 0.111744
Epoch 3840
Loss = 2.6592e-02, PNorm = 520.6282, GNorm = 4.3386, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.135450
Epoch 3841
Loss = 1.0099e-01, PNorm = 520.7077, GNorm = 0.8830, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.049406
Epoch 3842
Loss = 5.5789e-02, PNorm = 520.8397, GNorm = 0.7816, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.058267
Epoch 3843
Loss = 2.1333e-02, PNorm = 520.9643, GNorm = 0.3881, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.072890
Epoch 3844
Loss = 9.8573e-03, PNorm = 521.0362, GNorm = 0.0771, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.078181
Epoch 3845
Loss = 5.6198e-03, PNorm = 521.0813, GNorm = 0.4623, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.083180
Epoch 3846
Loss = 1.5387e-02, PNorm = 521.1171, GNorm = 1.2589, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.085298
Epoch 3847
Loss = 9.1251e-04, PNorm = 521.1530, GNorm = 0.0156, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.076070
Epoch 3848
Loss = 6.7032e-02, PNorm = 521.1836, GNorm = 1.7377, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.072823
Epoch 3849
Loss = 1.7052e-02, PNorm = 521.2656, GNorm = 0.7964, lr_0 = 9.9755e-04
Loss = 4.4577e-02, PNorm = 521.3392, GNorm = 4.2008, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.072850
Epoch 3850
Loss = 1.7390e-02, PNorm = 521.4282, GNorm = 1.5069, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.073912
Epoch 3851
Loss = 2.3324e-02, PNorm = 521.5535, GNorm = 0.0195, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.091440
Epoch 3852
Loss = 2.7184e-02, PNorm = 521.6296, GNorm = 1.1121, lr_0 = 9.9755e-04
Validation binary_cross_entropy = 0.080722
Epoch 3853
Loss = 8.5606e-02, PNorm = 521.6873, GNorm = 2.3231, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.060363
Epoch 3854
Loss = 3.8562e-02, PNorm = 521.7663, GNorm = 1.1503, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.059143
Epoch 3855
Loss = 1.9326e-02, PNorm = 521.8522, GNorm = 0.0815, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.061591
Epoch 3856
Loss = 2.5141e-02, PNorm = 521.9158, GNorm = 0.2764, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.058788
Epoch 3857
Loss = 2.6420e-02, PNorm = 521.9693, GNorm = 0.3569, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.062469
Epoch 3858
Loss = 3.8637e-02, PNorm = 522.0253, GNorm = 0.7352, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.068288
Epoch 3859
Loss = 1.9030e-02, PNorm = 522.0785, GNorm = 0.5552, lr_0 = 9.9754e-04
Loss = 2.5121e-02, PNorm = 522.1211, GNorm = 0.5590, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.074745
Epoch 3860
Loss = 3.8700e-02, PNorm = 522.1484, GNorm = 0.3092, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.067704
Epoch 3861
Loss = 5.5030e-02, PNorm = 522.2101, GNorm = 0.9910, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.065094
Epoch 3862
Loss = 1.5119e-02, PNorm = 522.2876, GNorm = 0.1399, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.069136
Epoch 3863
Loss = 1.5596e-02, PNorm = 522.3463, GNorm = 0.0310, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.069243
Epoch 3864
Loss = 2.3487e-02, PNorm = 522.3794, GNorm = 0.3291, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.063439
Epoch 3865
Loss = 9.8331e-03, PNorm = 522.4272, GNorm = 0.5661, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.068340
Epoch 3866
Loss = 4.6954e-02, PNorm = 522.4753, GNorm = 1.1488, lr_0 = 9.9754e-04
Validation binary_cross_entropy = 0.078887
Epoch 3867
Loss = 1.0558e-02, PNorm = 522.5196, GNorm = 0.7410, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.089374
Epoch 3868
Loss = 1.2592e-02, PNorm = 522.5633, GNorm = 0.4141, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.091191
Epoch 3869
Loss = 1.3956e-03, PNorm = 522.6020, GNorm = 0.1223, lr_0 = 9.9753e-04
Loss = 1.3084e-02, PNorm = 522.6301, GNorm = 0.3130, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.084967
Epoch 3870
Loss = 1.8556e-02, PNorm = 522.6842, GNorm = 1.1387, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.115324
Epoch 3871
Loss = 2.2280e-02, PNorm = 522.7518, GNorm = 0.0114, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.138660
Epoch 3872
Loss = 1.9661e-02, PNorm = 522.8000, GNorm = 5.8142, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.116568
Epoch 3873
Loss = 2.1870e-02, PNorm = 522.8471, GNorm = 0.3570, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.086725
Epoch 3874
Loss = 2.8974e-02, PNorm = 522.8847, GNorm = 0.1330, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.069393
Epoch 3875
Loss = 1.3113e-02, PNorm = 522.9534, GNorm = 0.3829, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.092281
Epoch 3876
Loss = 8.4497e-02, PNorm = 523.0263, GNorm = 2.3696, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.071140
Epoch 3877
Loss = 1.7568e-02, PNorm = 523.0939, GNorm = 0.1838, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.079637
Epoch 3878
Loss = 1.1245e-02, PNorm = 523.1988, GNorm = 0.2142, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.082124
Epoch 3879
Loss = 1.6562e-03, PNorm = 523.2919, GNorm = 0.0488, lr_0 = 9.9753e-04
Loss = 4.4621e-02, PNorm = 523.3672, GNorm = 0.1886, lr_0 = 9.9753e-04
Validation binary_cross_entropy = 0.068756
Epoch 3880
Loss = 5.9017e-02, PNorm = 523.4748, GNorm = 0.6717, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.092061
Epoch 3881
Loss = 2.4931e-02, PNorm = 523.5532, GNorm = 0.3127, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.072905
Epoch 3882
Loss = 6.1203e-02, PNorm = 523.6573, GNorm = 0.3375, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.071221
Epoch 3883
Loss = 1.6669e-02, PNorm = 523.7701, GNorm = 0.4420, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.094198
Epoch 3884
Loss = 3.0232e-02, PNorm = 523.8338, GNorm = 1.3275, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.072147
Epoch 3885
Loss = 1.5890e-02, PNorm = 523.8744, GNorm = 1.0043, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.073998
Epoch 3886
Loss = 1.5990e-02, PNorm = 523.9283, GNorm = 0.8227, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.079384
Epoch 3887
Loss = 3.2189e-02, PNorm = 523.9791, GNorm = 0.2690, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.117402
Epoch 3888
Loss = 1.1471e-02, PNorm = 524.0315, GNorm = 0.0372, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.078805
Epoch 3889
Loss = 6.2042e-03, PNorm = 524.0736, GNorm = 0.2395, lr_0 = 9.9752e-04
Loss = 5.3923e-02, PNorm = 524.1631, GNorm = 2.5825, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.063997
Epoch 3890
Loss = 3.4131e-02, PNorm = 524.3115, GNorm = 0.3471, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.059778
Epoch 3891
Loss = 4.2921e-02, PNorm = 524.4365, GNorm = 0.3167, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.064374
Epoch 3892
Loss = 4.9156e-02, PNorm = 524.5210, GNorm = 2.0501, lr_0 = 9.9752e-04
Validation binary_cross_entropy = 0.057086
Epoch 3893
Loss = 1.8504e-02, PNorm = 524.6087, GNorm = 1.1268, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.102002
Epoch 3894
Loss = 4.2345e-02, PNorm = 524.6757, GNorm = 1.8328, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.088985
Epoch 3895
Loss = 1.7335e-02, PNorm = 524.7167, GNorm = 2.5129, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.069049
Epoch 3896
Loss = 1.3417e-02, PNorm = 524.7653, GNorm = 1.9358, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.082534
Epoch 3897
Loss = 6.7556e-02, PNorm = 524.8227, GNorm = 0.8041, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.075102
Epoch 3898
Loss = 9.0717e-03, PNorm = 524.8698, GNorm = 0.4135, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.075948
Epoch 3899
Loss = 2.6403e-03, PNorm = 524.9082, GNorm = 0.1166, lr_0 = 9.9751e-04
Loss = 1.5792e-02, PNorm = 524.9616, GNorm = 0.2631, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.075692
Epoch 3900
Loss = 2.2932e-02, PNorm = 525.0193, GNorm = 0.0642, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.089492
Epoch 3901
Loss = 1.0159e-02, PNorm = 525.0605, GNorm = 0.2864, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.084608
Epoch 3902
Loss = 1.9585e-03, PNorm = 525.0926, GNorm = 0.0705, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.088939
Epoch 3903
Loss = 1.7160e-02, PNorm = 525.1236, GNorm = 0.0183, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.099790
Epoch 3904
Loss = 2.7253e-02, PNorm = 525.1743, GNorm = 0.1503, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.109233
Epoch 3905
Loss = 2.0291e-02, PNorm = 525.2209, GNorm = 0.7674, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.093720
Epoch 3906
Loss = 7.5569e-02, PNorm = 525.2945, GNorm = 2.7230, lr_0 = 9.9751e-04
Validation binary_cross_entropy = 0.219425
Epoch 3907
Loss = 1.4106e-01, PNorm = 525.4026, GNorm = 4.3853, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.090987
Epoch 3908
Loss = 1.5598e-02, PNorm = 525.4687, GNorm = 0.2421, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.067525
Epoch 3909
Loss = 2.3650e-02, PNorm = 525.5303, GNorm = 1.3360, lr_0 = 9.9750e-04
Loss = 4.5506e-02, PNorm = 525.6085, GNorm = 1.7685, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.078472
Epoch 3910
Loss = 3.0439e-02, PNorm = 525.6829, GNorm = 0.0632, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.079668
Epoch 3911
Loss = 3.9504e-02, PNorm = 525.7329, GNorm = 0.0374, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.076359
Epoch 3912
Loss = 1.8754e-02, PNorm = 525.7767, GNorm = 0.6481, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.075048
Epoch 3913
Loss = 7.1658e-03, PNorm = 525.8262, GNorm = 0.0750, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.086089
Epoch 3914
Loss = 1.9829e-02, PNorm = 525.8766, GNorm = 0.4965, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.074277
Epoch 3915
Loss = 6.7311e-02, PNorm = 525.9434, GNorm = 3.6271, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.072132
Epoch 3916
Loss = 6.0164e-03, PNorm = 526.0208, GNorm = 0.0782, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.078544
Epoch 3917
Loss = 1.3133e-02, PNorm = 526.0764, GNorm = 0.5000, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.068816
Epoch 3918
Loss = 4.0096e-02, PNorm = 526.1225, GNorm = 0.2633, lr_0 = 9.9750e-04
Validation binary_cross_entropy = 0.075976
Epoch 3919
Loss = 9.5632e-03, PNorm = 526.1818, GNorm = 0.5603, lr_0 = 9.9750e-04
Loss = 4.1666e-02, PNorm = 526.2383, GNorm = 0.4019, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.064082
Epoch 3920
Loss = 1.4160e-02, PNorm = 526.3204, GNorm = 0.0567, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.077286
Epoch 3921
Loss = 2.6055e-02, PNorm = 526.3909, GNorm = 0.9676, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.087422
Epoch 3922
Loss = 4.8300e-02, PNorm = 526.4509, GNorm = 0.8075, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.079024
Epoch 3923
Loss = 1.5617e-02, PNorm = 526.5170, GNorm = 0.4105, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.087300
Epoch 3924
Loss = 7.0638e-03, PNorm = 526.5753, GNorm = 0.3072, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.106605
Epoch 3925
Loss = 5.8938e-02, PNorm = 526.6181, GNorm = 1.0181, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.083578
Epoch 3926
Loss = 3.5573e-01, PNorm = 526.6842, GNorm = 36.3772, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.064026
Epoch 3927
Loss = 1.2296e-02, PNorm = 526.7973, GNorm = 1.0929, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.080947
Epoch 3928
Loss = 6.7426e-02, PNorm = 526.9126, GNorm = 0.3335, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.078012
Epoch 3929
Loss = 1.6614e-01, PNorm = 527.0107, GNorm = 4.0368, lr_0 = 9.9749e-04
Loss = 3.5397e-02, PNorm = 527.0987, GNorm = 0.3446, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.082560
Epoch 3930
Loss = 4.2009e-02, PNorm = 527.1850, GNorm = 0.4655, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.066277
Epoch 3931
Loss = 2.7346e-02, PNorm = 527.2699, GNorm = 2.7269, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.081414
Epoch 3932
Loss = 2.4580e-02, PNorm = 527.3520, GNorm = 1.0166, lr_0 = 9.9749e-04
Validation binary_cross_entropy = 0.066401
Epoch 3933
Loss = 1.7637e-02, PNorm = 527.4294, GNorm = 0.3792, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.075398
Epoch 3934
Loss = 9.1673e-02, PNorm = 527.5016, GNorm = 0.7083, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.056833
Epoch 3935
Loss = 1.6724e-01, PNorm = 527.5936, GNorm = 0.7405, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.076606
Epoch 3936
Loss = 5.9174e-02, PNorm = 527.7051, GNorm = 0.8275, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.051701
Epoch 3937
Loss = 4.1158e-02, PNorm = 527.8034, GNorm = 1.6040, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.073471
Epoch 3938
Loss = 1.0026e-02, PNorm = 527.9015, GNorm = 0.6275, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.066585
Epoch 3939
Loss = 1.8076e-01, PNorm = 528.0055, GNorm = 3.7373, lr_0 = 9.9748e-04
Loss = 5.2534e-02, PNorm = 528.1269, GNorm = 0.5229, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.106434
Epoch 3940
Loss = 3.1827e-02, PNorm = 528.2404, GNorm = 0.3759, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.084380
Epoch 3941
Loss = 9.3671e-02, PNorm = 528.3255, GNorm = 1.5900, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.057494
Epoch 3942
Loss = 6.2870e-02, PNorm = 528.4165, GNorm = 0.4697, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.078145
Epoch 3943
Loss = 1.8842e-02, PNorm = 528.4972, GNorm = 0.5453, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.079798
Epoch 3944
Loss = 1.0912e-02, PNorm = 528.5544, GNorm = 0.1148, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.079323
Epoch 3945
Loss = 7.9120e-03, PNorm = 528.6101, GNorm = 0.0272, lr_0 = 9.9748e-04
Validation binary_cross_entropy = 0.104208
Epoch 3946
Loss = 4.1005e-02, PNorm = 528.6557, GNorm = 0.8835, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.086547
Epoch 3947
Loss = 7.7718e-02, PNorm = 528.6934, GNorm = 0.3314, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.069822
Epoch 3948
Loss = 1.4039e-02, PNorm = 528.7501, GNorm = 0.5257, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.072792
Epoch 3949
Loss = 6.9359e-03, PNorm = 528.8144, GNorm = 0.2112, lr_0 = 9.9747e-04
Loss = 1.1282e-02, PNorm = 528.8704, GNorm = 0.9024, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.072254
Epoch 3950
Loss = 1.8062e-02, PNorm = 528.9232, GNorm = 0.0486, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.085406
Epoch 3951
Loss = 4.2171e-02, PNorm = 528.9819, GNorm = 0.5096, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.066940
Epoch 3952
Loss = 2.2954e-02, PNorm = 529.0644, GNorm = 3.0551, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.073330
Epoch 3953
Loss = 2.0602e-02, PNorm = 529.1275, GNorm = 0.4586, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.072692
Epoch 3954
Loss = 8.3775e-03, PNorm = 529.1874, GNorm = 0.1940, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.077794
Epoch 3955
Loss = 2.5300e-02, PNorm = 529.2442, GNorm = 1.8016, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.085936
Epoch 3956
Loss = 1.5701e-02, PNorm = 529.2965, GNorm = 0.0430, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.097556
Epoch 3957
Loss = 8.4222e-03, PNorm = 529.3476, GNorm = 1.0651, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.085167
Epoch 3958
Loss = 4.9254e-02, PNorm = 529.3967, GNorm = 2.7309, lr_0 = 9.9747e-04
Validation binary_cross_entropy = 0.075820
Epoch 3959
Loss = 6.9985e-03, PNorm = 529.4379, GNorm = 0.2993, lr_0 = 9.9747e-04
Loss = 3.7823e-02, PNorm = 529.5043, GNorm = 1.4957, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.088308
Epoch 3960
Loss = 4.3438e-02, PNorm = 529.5672, GNorm = 0.7847, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.067794
Epoch 3961
Loss = 3.2033e-02, PNorm = 529.6515, GNorm = 0.0489, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.080680
Epoch 3962
Loss = 2.6792e-02, PNorm = 529.7234, GNorm = 0.3487, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.068153
Epoch 3963
Loss = 3.9380e-02, PNorm = 529.7837, GNorm = 0.7991, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.069813
Epoch 3964
Loss = 9.2288e-02, PNorm = 529.8364, GNorm = 0.4357, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.057153
Epoch 3965
Loss = 4.5574e-02, PNorm = 529.9242, GNorm = 0.7391, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.069323
Epoch 3966
Loss = 7.0192e-02, PNorm = 530.0062, GNorm = 1.8487, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.066842
Epoch 3967
Loss = 6.2389e-03, PNorm = 530.0848, GNorm = 0.8472, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.069382
Epoch 3968
Loss = 4.5013e-02, PNorm = 530.1416, GNorm = 1.0943, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.079186
Epoch 3969
Loss = 2.7555e-03, PNorm = 530.1912, GNorm = 0.2010, lr_0 = 9.9746e-04
Loss = 2.0646e-02, PNorm = 530.2371, GNorm = 0.5597, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.076690
Epoch 3970
Loss = 8.0509e-02, PNorm = 530.2905, GNorm = 9.0711, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.087922
Epoch 3971
Loss = 7.3170e-02, PNorm = 530.3839, GNorm = 1.9132, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.076032
Epoch 3972
Loss = 8.7148e-02, PNorm = 530.4740, GNorm = 1.6727, lr_0 = 9.9746e-04
Validation binary_cross_entropy = 0.057052
Epoch 3973
Loss = 1.9795e-02, PNorm = 530.5785, GNorm = 0.1006, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.089798
Epoch 3974
Loss = 5.9232e-02, PNorm = 530.6432, GNorm = 0.0212, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.070344
Epoch 3975
Loss = 4.5326e-02, PNorm = 530.6926, GNorm = 1.3764, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.066870
Epoch 3976
Loss = 1.3845e-01, PNorm = 530.7519, GNorm = 3.5967, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.062186
Epoch 3977
Loss = 2.0645e-02, PNorm = 530.8484, GNorm = 0.2286, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.059538
Epoch 3978
Loss = 6.8712e-02, PNorm = 530.9206, GNorm = 0.4612, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.051130
Epoch 3979
Loss = 5.4116e-03, PNorm = 530.9980, GNorm = 0.1442, lr_0 = 9.9745e-04
Loss = 2.4512e-02, PNorm = 531.0686, GNorm = 2.4133, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.075819
Epoch 3980
Loss = 1.5943e-02, PNorm = 531.1102, GNorm = 2.0829, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.065202
Epoch 3981
Loss = 1.1335e-02, PNorm = 531.1567, GNorm = 1.9639, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.069082
Epoch 3982
Loss = 1.0040e-02, PNorm = 531.1966, GNorm = 1.4954, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.076500
Epoch 3983
Loss = 4.2506e-02, PNorm = 531.2394, GNorm = 0.2796, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.066912
Epoch 3984
Loss = 2.2711e-02, PNorm = 531.3134, GNorm = 2.8278, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.086033
Epoch 3985
Loss = 5.4970e-02, PNorm = 531.3825, GNorm = 1.2837, lr_0 = 9.9745e-04
Validation binary_cross_entropy = 0.068300
Epoch 3986
Loss = 1.4998e-02, PNorm = 531.4410, GNorm = 0.1389, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.073368
Epoch 3987
Loss = 2.8726e-03, PNorm = 531.5022, GNorm = 0.0356, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.096870
Epoch 3988
Loss = 2.6397e-02, PNorm = 531.5565, GNorm = 0.0182, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.089266
Epoch 3989
Loss = 6.2670e-02, PNorm = 531.5995, GNorm = 1.9170, lr_0 = 9.9744e-04
Loss = 5.5306e-02, PNorm = 531.6797, GNorm = 3.4491, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.053746
Epoch 3990
Loss = 2.5104e-02, PNorm = 531.8155, GNorm = 1.0432, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.080084
Epoch 3991
Loss = 4.1814e-02, PNorm = 531.9020, GNorm = 0.1982, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.065486
Epoch 3992
Loss = 1.8959e-02, PNorm = 531.9655, GNorm = 1.3717, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.066481
Epoch 3993
Loss = 1.0734e-02, PNorm = 532.0329, GNorm = 0.9942, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.072419
Epoch 3994
Loss = 4.1747e-02, PNorm = 532.0873, GNorm = 0.6906, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.078325
Epoch 3995
Loss = 4.6533e-03, PNorm = 532.1265, GNorm = 0.2643, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.074979
Epoch 3996
Loss = 4.8249e-03, PNorm = 532.1617, GNorm = 0.1753, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.085072
Epoch 3997
Loss = 2.8846e-02, PNorm = 532.1895, GNorm = 0.3080, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.097434
Epoch 3998
Loss = 1.3169e-02, PNorm = 532.2181, GNorm = 0.1216, lr_0 = 9.9744e-04
Validation binary_cross_entropy = 0.094204
Epoch 3999
Loss = 3.7618e-03, PNorm = 532.2474, GNorm = 0.1228, lr_0 = 9.9744e-04
Loss = 1.8771e-01, PNorm = 532.2822, GNorm = 2.0990, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.073861
Epoch 4000
Loss = 1.7366e-02, PNorm = 532.3357, GNorm = 0.5265, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.054750
Epoch 4001
Loss = 2.3006e-02, PNorm = 532.3959, GNorm = 0.2229, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.060466
Epoch 4002
Loss = 3.8041e-02, PNorm = 532.4546, GNorm = 0.9172, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.077511
Epoch 4003
Loss = 6.5563e-03, PNorm = 532.5012, GNorm = 0.4082, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.082119
Epoch 4004
Loss = 4.1929e-02, PNorm = 532.5367, GNorm = 1.2361, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.061976
Epoch 4005
Loss = 3.3128e-02, PNorm = 532.7193, GNorm = 0.7741, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.084681
Epoch 4006
Loss = 8.1565e-02, PNorm = 532.9298, GNorm = 3.9388, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.073164
Epoch 4007
Loss = 1.1728e-01, PNorm = 533.0798, GNorm = 0.6672, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.050201
Epoch 4008
Loss = 3.5303e-02, PNorm = 533.1905, GNorm = 0.9280, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.075730
Epoch 4009
Loss = 1.4633e-02, PNorm = 533.2725, GNorm = 0.8941, lr_0 = 9.9743e-04
Loss = 2.8556e-02, PNorm = 533.3343, GNorm = 0.5707, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.050001
Epoch 4010
Loss = 4.9370e-02, PNorm = 533.3904, GNorm = 0.3068, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.054751
Epoch 4011
Loss = 4.3092e-02, PNorm = 533.4453, GNorm = 1.8904, lr_0 = 9.9743e-04
Validation binary_cross_entropy = 0.073323
Epoch 4012
Loss = 1.5107e-02, PNorm = 533.4984, GNorm = 0.8200, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.079441
Epoch 4013
Loss = 2.0512e-02, PNorm = 533.5405, GNorm = 0.1453, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.071651
Epoch 4014
Loss = 3.0149e-02, PNorm = 533.5824, GNorm = 4.2735, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.065957
Epoch 4015
Loss = 7.3290e-02, PNorm = 533.6318, GNorm = 1.4749, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.067686
Epoch 4016
Loss = 2.4071e-02, PNorm = 533.7024, GNorm = 1.2417, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.063600
Epoch 4017
Loss = 5.4277e-02, PNorm = 533.7530, GNorm = 2.2810, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.059521
Epoch 4018
Loss = 1.1594e-02, PNorm = 533.7946, GNorm = 0.5251, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.058636
Epoch 4019
Loss = 6.3888e-03, PNorm = 533.8416, GNorm = 0.2118, lr_0 = 9.9742e-04
Loss = 3.2217e-02, PNorm = 533.8862, GNorm = 0.0938, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.062306
Epoch 4020
Loss = 1.7693e-02, PNorm = 533.9283, GNorm = 0.0725, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.067314
Epoch 4021
Loss = 1.4610e-02, PNorm = 533.9683, GNorm = 0.4260, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.075811
Epoch 4022
Loss = 2.2142e-02, PNorm = 533.9947, GNorm = 0.0683, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.059086
Epoch 4023
Loss = 6.4476e-02, PNorm = 534.0567, GNorm = 0.2516, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.083951
Epoch 4024
Loss = 4.9355e-02, PNorm = 534.1236, GNorm = 1.0892, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.057686
Epoch 4025
Loss = 7.5880e-03, PNorm = 534.1647, GNorm = 0.2565, lr_0 = 9.9742e-04
Validation binary_cross_entropy = 0.052066
Epoch 4026
Loss = 1.0213e-01, PNorm = 534.2220, GNorm = 3.8184, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.065207
Epoch 4027
Loss = 2.1808e-02, PNorm = 534.2955, GNorm = 0.5305, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.058560
Epoch 4028
Loss = 6.9831e-03, PNorm = 534.3506, GNorm = 0.2677, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.072784
Epoch 4029
Loss = 1.2435e-02, PNorm = 534.4031, GNorm = 1.1520, lr_0 = 9.9741e-04
Loss = 7.7867e-03, PNorm = 534.4394, GNorm = 0.5816, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.087651
Epoch 4030
Loss = 4.9157e-03, PNorm = 534.4685, GNorm = 0.0231, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.119369
Epoch 4031
Loss = 3.6707e-02, PNorm = 534.4934, GNorm = 1.3655, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.071560
Epoch 4032
Loss = 6.1635e-02, PNorm = 534.5477, GNorm = 2.2113, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.079764
Epoch 4033
Loss = 4.5534e-02, PNorm = 534.6339, GNorm = 1.9131, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.103921
Epoch 4034
Loss = 4.5602e-02, PNorm = 534.6968, GNorm = 0.1958, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.076317
Epoch 4035
Loss = 1.9014e-02, PNorm = 534.7507, GNorm = 1.2479, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.079259
Epoch 4036
Loss = 3.4548e-02, PNorm = 534.8041, GNorm = 0.1571, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.084809
Epoch 4037
Loss = 7.4977e-03, PNorm = 534.8435, GNorm = 0.3881, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.092927
Epoch 4038
Loss = 5.1173e-03, PNorm = 534.8809, GNorm = 0.0069, lr_0 = 9.9741e-04
Validation binary_cross_entropy = 0.105374
Epoch 4039
Loss = 2.5729e-02, PNorm = 534.9108, GNorm = 1.2285, lr_0 = 9.9740e-04
Loss = 3.6549e-02, PNorm = 534.9388, GNorm = 1.4549, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.111741
Epoch 4040
Loss = 2.0595e-02, PNorm = 534.9760, GNorm = 3.2751, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.111576
Epoch 4041
Loss = 1.4208e-02, PNorm = 535.0181, GNorm = 0.2395, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.100333
Epoch 4042
Loss = 4.2832e-02, PNorm = 535.0590, GNorm = 0.4132, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.096805
Epoch 4043
Loss = 2.6976e-02, PNorm = 535.0990, GNorm = 0.7790, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.089909
Epoch 4044
Loss = 1.4821e-02, PNorm = 535.1353, GNorm = 1.9393, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.089725
Epoch 4045
Loss = 1.0925e-02, PNorm = 535.1791, GNorm = 0.0552, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.095153
Epoch 4046
Loss = 1.7893e-03, PNorm = 535.2173, GNorm = 0.0603, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.092532
Epoch 4047
Loss = 3.6057e-02, PNorm = 535.2545, GNorm = 1.0873, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.079390
Epoch 4048
Loss = 6.4267e-02, PNorm = 535.3023, GNorm = 1.2558, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.118210
Epoch 4049
Loss = 1.6842e-03, PNorm = 535.3785, GNorm = 0.0574, lr_0 = 9.9740e-04
Loss = 4.1242e-02, PNorm = 535.4403, GNorm = 0.2102, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.090525
Epoch 4050
Loss = 5.4928e-02, PNorm = 535.5256, GNorm = 0.0619, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.086855
Epoch 4051
Loss = 3.8081e-02, PNorm = 535.6056, GNorm = 0.7954, lr_0 = 9.9740e-04
Validation binary_cross_entropy = 0.064490
Epoch 4052
Loss = 4.0569e-02, PNorm = 535.6777, GNorm = 0.3090, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.061001
Epoch 4053
Loss = 2.0704e-02, PNorm = 535.7385, GNorm = 1.3717, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.068706
Epoch 4054
Loss = 4.4462e-02, PNorm = 535.7912, GNorm = 0.0330, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.070467
Epoch 4055
Loss = 3.8223e-02, PNorm = 535.8348, GNorm = 0.2610, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.064057
Epoch 4056
Loss = 2.2111e-02, PNorm = 535.8715, GNorm = 0.7905, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.085509
Epoch 4057
Loss = 6.8772e-03, PNorm = 535.9239, GNorm = 0.4798, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.102904
Epoch 4058
Loss = 2.0286e-02, PNorm = 535.9646, GNorm = 0.0493, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.098991
Epoch 4059
Loss = 1.5096e-02, PNorm = 535.9989, GNorm = 0.8713, lr_0 = 9.9739e-04
Loss = 1.0496e-01, PNorm = 536.0442, GNorm = 5.0576, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.067159
Epoch 4060
Loss = 3.8581e-02, PNorm = 536.1268, GNorm = 1.0735, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.077038
Epoch 4061
Loss = 1.9650e-02, PNorm = 536.1942, GNorm = 0.2710, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.079003
Epoch 4062
Loss = 1.0489e-02, PNorm = 536.2427, GNorm = 0.1356, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.089096
Epoch 4063
Loss = 1.5412e-02, PNorm = 536.2889, GNorm = 1.2044, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.088046
Epoch 4064
Loss = 2.2740e-02, PNorm = 536.3395, GNorm = 1.1985, lr_0 = 9.9739e-04
Validation binary_cross_entropy = 0.111996
Epoch 4065
Loss = 8.5372e-02, PNorm = 536.4134, GNorm = 3.4873, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.102870
Epoch 4066
Loss = 1.1519e-02, PNorm = 536.4892, GNorm = 1.1120, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.099644
Epoch 4067
Loss = 8.1787e-02, PNorm = 536.5622, GNorm = 1.3679, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.095676
Epoch 4068
Loss = 5.4884e-03, PNorm = 536.6290, GNorm = 0.6208, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.078219
Epoch 4069
Loss = 6.4953e-03, PNorm = 536.6970, GNorm = 0.2677, lr_0 = 9.9738e-04
Loss = 2.0432e-02, PNorm = 536.7750, GNorm = 1.3620, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.064846
Epoch 4070
Loss = 3.9903e-02, PNorm = 536.8415, GNorm = 1.6777, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.068860
Epoch 4071
Loss = 1.9190e-02, PNorm = 536.9095, GNorm = 1.8258, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.081280
Epoch 4072
Loss = 4.2774e-03, PNorm = 536.9613, GNorm = 0.0599, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.094968
Epoch 4073
Loss = 3.5099e-02, PNorm = 536.9996, GNorm = 2.4008, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.103942
Epoch 4074
Loss = 4.7866e-02, PNorm = 537.0284, GNorm = 1.1481, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.074718
Epoch 4075
Loss = 1.9425e-02, PNorm = 537.1086, GNorm = 0.2591, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.098475
Epoch 4076
Loss = 6.0859e-02, PNorm = 537.1722, GNorm = 0.1598, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.082038
Epoch 4077
Loss = 3.4085e-02, PNorm = 537.2574, GNorm = 0.3026, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.097209
Epoch 4078
Loss = 2.0714e-02, PNorm = 537.3430, GNorm = 0.0650, lr_0 = 9.9738e-04
Validation binary_cross_entropy = 0.071411
Epoch 4079
Loss = 2.7599e-02, PNorm = 537.4240, GNorm = 0.6598, lr_0 = 9.9737e-04
Loss = 3.0650e-02, PNorm = 537.4946, GNorm = 0.4495, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.060793
Epoch 4080
Loss = 2.0792e-02, PNorm = 537.5484, GNorm = 0.4529, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.068060
Epoch 4081
Loss = 5.7235e-02, PNorm = 537.5935, GNorm = 1.2975, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.062438
Epoch 4082
Loss = 1.4228e-02, PNorm = 537.6445, GNorm = 0.3287, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.080068
Epoch 4083
Loss = 4.4668e-02, PNorm = 537.6821, GNorm = 1.5721, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.107681
Epoch 4084
Loss = 1.5843e-03, PNorm = 537.7136, GNorm = 0.0341, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.123525
Epoch 4085
Loss = 2.0270e-04, PNorm = 537.7353, GNorm = 0.0189, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.105460
Epoch 4086
Loss = 7.6626e-02, PNorm = 537.7481, GNorm = 3.5686, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.081311
Epoch 4087
Loss = 2.4521e-02, PNorm = 537.8062, GNorm = 2.5790, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.100069
Epoch 4088
Loss = 2.7147e-02, PNorm = 537.8609, GNorm = 1.0639, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.075832
Epoch 4089
Loss = 2.6619e-02, PNorm = 537.8992, GNorm = 1.1570, lr_0 = 9.9737e-04
Loss = 3.4086e-02, PNorm = 537.9396, GNorm = 2.4956, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.067266
Epoch 4090
Loss = 8.2253e-03, PNorm = 537.9822, GNorm = 0.0596, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.070150
Epoch 4091
Loss = 1.4860e-02, PNorm = 538.0203, GNorm = 0.2966, lr_0 = 9.9737e-04
Validation binary_cross_entropy = 0.077641
Epoch 4092
Loss = 8.0221e-03, PNorm = 538.0483, GNorm = 0.3063, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.078994
Epoch 4093
Loss = 7.7128e-03, PNorm = 538.0730, GNorm = 0.6142, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.093101
Epoch 4094
Loss = 1.5769e-02, PNorm = 538.0988, GNorm = 2.6791, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.092240
Epoch 4095
Loss = 3.3056e-02, PNorm = 538.1285, GNorm = 1.6080, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.132062
Epoch 4096
Loss = 1.6098e-02, PNorm = 538.2189, GNorm = 0.6441, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.170401
Epoch 4097
Loss = 4.9704e-02, PNorm = 538.2791, GNorm = 3.2946, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.086374
Epoch 4098
Loss = 2.2948e-02, PNorm = 538.3271, GNorm = 0.5387, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.094983
Epoch 4099
Loss = 6.0078e-04, PNorm = 538.3836, GNorm = 0.0245, lr_0 = 9.9736e-04
Loss = 4.5819e-02, PNorm = 538.4347, GNorm = 1.5646, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.083987
Epoch 4100
Loss = 3.4002e-02, PNorm = 538.4903, GNorm = 0.3250, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.081564
Epoch 4101
Loss = 9.3445e-03, PNorm = 538.5399, GNorm = 0.3965, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.079508
Epoch 4102
Loss = 2.9609e-02, PNorm = 538.5859, GNorm = 0.0269, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.089281
Epoch 4103
Loss = 2.9771e-03, PNorm = 538.6192, GNorm = 0.1038, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.089410
Epoch 4104
Loss = 4.1528e-03, PNorm = 538.6430, GNorm = 0.1505, lr_0 = 9.9736e-04
Validation binary_cross_entropy = 0.091302
Epoch 4105
Loss = 4.3495e-03, PNorm = 538.6679, GNorm = 0.4640, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.097742
Epoch 4106
Loss = 6.1406e-02, PNorm = 538.7062, GNorm = 0.5349, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.102263
Epoch 4107
Loss = 2.0249e-01, PNorm = 538.7589, GNorm = 10.9884, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.072344
Epoch 4108
Loss = 1.7825e-02, PNorm = 538.8260, GNorm = 0.6905, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.080079
Epoch 4109
Loss = 2.2886e-02, PNorm = 538.8981, GNorm = 0.5181, lr_0 = 9.9735e-04
Loss = 3.9231e-02, PNorm = 538.9555, GNorm = 0.4116, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.080345
Epoch 4110
Loss = 2.9060e-02, PNorm = 539.0012, GNorm = 0.1765, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.072486
Epoch 4111
Loss = 3.5458e-02, PNorm = 539.0558, GNorm = 0.4877, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.078710
Epoch 4112
Loss = 5.6495e-02, PNorm = 539.1149, GNorm = 1.1479, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.056417
Epoch 4113
Loss = 3.1431e-02, PNorm = 539.1978, GNorm = 1.6808, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.060952
Epoch 4114
Loss = 2.3708e-02, PNorm = 539.2649, GNorm = 0.8081, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.100545
Epoch 4115
Loss = 1.3961e-02, PNorm = 539.3236, GNorm = 0.8207, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.097506
Epoch 4116
Loss = 5.5534e-03, PNorm = 539.3605, GNorm = 0.5207, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.086047
Epoch 4117
Loss = 9.2657e-02, PNorm = 539.3939, GNorm = 0.0172, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.104767
Epoch 4118
Loss = 3.3344e-02, PNorm = 539.4558, GNorm = 0.8960, lr_0 = 9.9735e-04
Validation binary_cross_entropy = 0.071017
Epoch 4119
Loss = 6.0537e-02, PNorm = 539.5188, GNorm = 1.5390, lr_0 = 9.9734e-04
Loss = 4.7335e-02, PNorm = 539.5937, GNorm = 1.0713, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.075965
Epoch 4120
Loss = 9.0521e-03, PNorm = 539.6625, GNorm = 0.2075, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.108176
Epoch 4121
Loss = 3.9549e-02, PNorm = 539.6963, GNorm = 1.6067, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.068809
Epoch 4122
Loss = 5.7862e-02, PNorm = 539.7437, GNorm = 3.1349, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.061785
Epoch 4123
Loss = 5.1493e-02, PNorm = 539.8048, GNorm = 0.7656, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.057606
Epoch 4124
Loss = 4.2433e-02, PNorm = 539.8767, GNorm = 0.8811, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.070923
Epoch 4125
Loss = 3.3409e-02, PNorm = 539.9522, GNorm = 3.3716, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.061559
Epoch 4126
Loss = 8.8748e-02, PNorm = 540.0080, GNorm = 6.1616, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.095858
Epoch 4127
Loss = 1.0626e-02, PNorm = 540.0559, GNorm = 0.6167, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.092303
Epoch 4128
Loss = 5.4302e-03, PNorm = 540.0991, GNorm = 0.5433, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.084646
Epoch 4129
Loss = 8.9023e-02, PNorm = 540.1397, GNorm = 4.9668, lr_0 = 9.9734e-04
Loss = 5.4344e-02, PNorm = 540.2115, GNorm = 3.4424, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.070906
Epoch 4130
Loss = 5.6264e-02, PNorm = 540.2958, GNorm = 2.3732, lr_0 = 9.9734e-04
Validation binary_cross_entropy = 0.073664
Epoch 4131
Loss = 3.8876e-02, PNorm = 540.3728, GNorm = 0.3047, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.068948
Epoch 4132
Loss = 4.4388e-02, PNorm = 540.4449, GNorm = 0.4173, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.075352
Epoch 4133
Loss = 5.2026e-02, PNorm = 540.4975, GNorm = 0.1899, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.064896
Epoch 4134
Loss = 5.2251e-02, PNorm = 540.5680, GNorm = 0.4788, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.085220
Epoch 4135
Loss = 9.3788e-02, PNorm = 540.6356, GNorm = 0.9778, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.076857
Epoch 4136
Loss = 2.8643e-02, PNorm = 540.6989, GNorm = 0.2310, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.079058
Epoch 4137
Loss = 1.5710e-02, PNorm = 540.7617, GNorm = 0.5677, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.085575
Epoch 4138
Loss = 7.4638e-03, PNorm = 540.8130, GNorm = 0.6186, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.079027
Epoch 4139
Loss = 2.2365e-02, PNorm = 540.8579, GNorm = 1.1820, lr_0 = 9.9733e-04
Loss = 1.3349e-02, PNorm = 540.9025, GNorm = 0.6827, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.093139
Epoch 4140
Loss = 6.2576e-02, PNorm = 540.9510, GNorm = 2.4274, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.071985
Epoch 4141
Loss = 6.8324e-02, PNorm = 541.0303, GNorm = 1.2806, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.111149
Epoch 4142
Loss = 4.6157e-02, PNorm = 541.1113, GNorm = 0.9872, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.082956
Epoch 4143
Loss = 1.6264e-02, PNorm = 541.1707, GNorm = 0.1264, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.080286
Epoch 4144
Loss = 5.2035e-02, PNorm = 541.2329, GNorm = 0.3561, lr_0 = 9.9733e-04
Validation binary_cross_entropy = 0.099119
Epoch 4145
Loss = 4.2257e-02, PNorm = 541.3208, GNorm = 1.0672, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.181417
Epoch 4146
Loss = 2.1871e-02, PNorm = 541.4044, GNorm = 1.3345, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.194762
Epoch 4147
Loss = 1.1607e-02, PNorm = 541.4621, GNorm = 0.2906, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.214631
Epoch 4148
Loss = 1.6750e-01, PNorm = 541.5119, GNorm = 2.7023, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.130169
Epoch 4149
Loss = 6.9135e-02, PNorm = 541.5758, GNorm = 1.1638, lr_0 = 9.9732e-04
Loss = 2.0953e-02, PNorm = 541.6524, GNorm = 0.3678, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.132756
Epoch 4150
Loss = 7.4515e-03, PNorm = 541.7121, GNorm = 0.2549, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.166570
Epoch 4151
Loss = 3.4303e-03, PNorm = 541.7498, GNorm = 0.0385, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.229877
Epoch 4152
Loss = 8.2082e-02, PNorm = 541.7884, GNorm = 3.0535, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.099156
Epoch 4153
Loss = 3.1613e-02, PNorm = 541.8604, GNorm = 0.4708, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.114550
Epoch 4154
Loss = 8.7019e-03, PNorm = 541.9384, GNorm = 0.0680, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.121732
Epoch 4155
Loss = 2.6765e-03, PNorm = 541.9855, GNorm = 0.3074, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.101707
Epoch 4156
Loss = 7.2911e-03, PNorm = 542.0196, GNorm = 0.5266, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.105961
Epoch 4157
Loss = 2.8913e-02, PNorm = 542.0759, GNorm = 0.3862, lr_0 = 9.9732e-04
Validation binary_cross_entropy = 0.139423
Epoch 4158
Loss = 6.0353e-03, PNorm = 542.1401, GNorm = 0.4332, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.127556
Epoch 4159
Loss = 6.4538e-04, PNorm = 542.1975, GNorm = 0.0274, lr_0 = 9.9731e-04
Loss = 6.4083e-02, PNorm = 542.2722, GNorm = 0.7175, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.126729
Epoch 4160
Loss = 4.4296e-02, PNorm = 542.3794, GNorm = 0.6615, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.121261
Epoch 4161
Loss = 9.0522e-02, PNorm = 542.4776, GNorm = 0.8616, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.090009
Epoch 4162
Loss = 2.4303e-02, PNorm = 542.5682, GNorm = 0.5587, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.100283
Epoch 4163
Loss = 6.8124e-02, PNorm = 542.6396, GNorm = 0.2379, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.085571
Epoch 4164
Loss = 6.8904e-02, PNorm = 542.7183, GNorm = 0.0375, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.081343
Epoch 4165
Loss = 3.3081e-02, PNorm = 542.7994, GNorm = 0.7498, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.075336
Epoch 4166
Loss = 4.0519e-02, PNorm = 542.8642, GNorm = 0.7406, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.078772
Epoch 4167
Loss = 9.9865e-03, PNorm = 542.9128, GNorm = 0.3361, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.094600
Epoch 4168
Loss = 7.4945e-02, PNorm = 542.9553, GNorm = 3.7599, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.099756
Epoch 4169
Loss = 3.4435e-02, PNorm = 543.0004, GNorm = 2.0853, lr_0 = 9.9731e-04
Loss = 3.4948e-02, PNorm = 543.0403, GNorm = 1.2684, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.080335
Epoch 4170
Loss = 7.2149e-02, PNorm = 543.0990, GNorm = 0.1039, lr_0 = 9.9731e-04
Validation binary_cross_entropy = 0.082673
Epoch 4171
Loss = 4.3130e-02, PNorm = 543.1633, GNorm = 0.3085, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.066461
Epoch 4172
Loss = 3.0733e-02, PNorm = 543.2306, GNorm = 1.6416, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.074560
Epoch 4173
Loss = 3.3019e-02, PNorm = 543.2886, GNorm = 0.2119, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.078299
Epoch 4174
Loss = 3.0654e-02, PNorm = 543.3345, GNorm = 0.2793, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.080254
Epoch 4175
Loss = 1.3714e-02, PNorm = 543.3666, GNorm = 0.1083, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.082001
Epoch 4176
Loss = 3.4852e-03, PNorm = 543.3992, GNorm = 0.0332, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.095980
Epoch 4177
Loss = 1.3911e-02, PNorm = 543.4283, GNorm = 0.3438, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.078101
Epoch 4178
Loss = 1.3890e-02, PNorm = 543.4549, GNorm = 2.9010, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.071584
Epoch 4179
Loss = 6.2054e-02, PNorm = 543.4994, GNorm = 2.5116, lr_0 = 9.9730e-04
Loss = 1.6875e-02, PNorm = 543.5459, GNorm = 0.0410, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.088610
Epoch 4180
Loss = 1.4774e-02, PNorm = 543.5761, GNorm = 0.0195, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.085359
Epoch 4181
Loss = 1.2797e-02, PNorm = 543.6019, GNorm = 0.3713, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.088367
Epoch 4182
Loss = 1.7257e-02, PNorm = 543.6303, GNorm = 1.9098, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.114073
Epoch 4183
Loss = 2.9632e-02, PNorm = 543.6605, GNorm = 0.0011, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.095598
Epoch 4184
Loss = 7.6023e-03, PNorm = 543.7051, GNorm = 0.8443, lr_0 = 9.9730e-04
Validation binary_cross_entropy = 0.119560
Epoch 4185
Loss = 2.3984e-03, PNorm = 543.7419, GNorm = 0.0452, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.109306
Epoch 4186
Loss = 1.2969e-03, PNorm = 543.7639, GNorm = 0.1040, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.108006
Epoch 4187
Loss = 1.1317e-02, PNorm = 543.7944, GNorm = 0.0282, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.129743
Epoch 4188
Loss = 2.0241e-04, PNorm = 543.8397, GNorm = 0.0023, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.124583
Epoch 4189
Loss = 4.5967e-04, PNorm = 543.8708, GNorm = 0.0541, lr_0 = 9.9729e-04
Loss = 7.4311e-03, PNorm = 543.8973, GNorm = 0.0037, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.121469
Epoch 4190
Loss = 2.2980e-02, PNorm = 543.9325, GNorm = 0.0419, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.122092
Epoch 4191
Loss = 4.0748e-02, PNorm = 543.9721, GNorm = 0.4960, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.095298
Epoch 4192
Loss = 9.2583e-02, PNorm = 544.0390, GNorm = 0.3527, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.078243
Epoch 4193
Loss = 1.9743e-02, PNorm = 544.1203, GNorm = 0.1267, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.114345
Epoch 4194
Loss = 2.8441e-02, PNorm = 544.1855, GNorm = 0.2137, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.079825
Epoch 4195
Loss = 2.7759e-02, PNorm = 544.2378, GNorm = 1.2521, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.079228
Epoch 4196
Loss = 2.4577e-02, PNorm = 544.2995, GNorm = 0.6163, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.094906
Epoch 4197
Loss = 3.3003e-03, PNorm = 544.3595, GNorm = 0.0151, lr_0 = 9.9729e-04
Validation binary_cross_entropy = 0.122974
Epoch 4198
Loss = 2.5890e-03, PNorm = 544.4060, GNorm = 0.3290, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.143607
Epoch 4199
Loss = 1.9491e-03, PNorm = 544.4375, GNorm = 0.2985, lr_0 = 9.9728e-04
Loss = 6.9111e-02, PNorm = 544.4577, GNorm = 4.9549, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.106023
Epoch 4200
Loss = 2.0364e-02, PNorm = 544.5399, GNorm = 0.2475, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.093297
Epoch 4201
Loss = 6.9469e-02, PNorm = 544.6125, GNorm = 0.4735, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.063242
Epoch 4202
Loss = 2.2251e-02, PNorm = 544.7029, GNorm = 0.1009, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.077920
Epoch 4203
Loss = 1.3156e-02, PNorm = 544.7737, GNorm = 0.1010, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.081522
Epoch 4204
Loss = 5.8218e-02, PNorm = 544.8281, GNorm = 2.6843, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.071488
Epoch 4205
Loss = 3.3789e-02, PNorm = 544.8948, GNorm = 0.9068, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.090566
Epoch 4206
Loss = 1.3827e-02, PNorm = 544.9531, GNorm = 0.0736, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.075318
Epoch 4207
Loss = 5.4236e-02, PNorm = 545.0075, GNorm = 0.8928, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.079036
Epoch 4208
Loss = 1.0032e-02, PNorm = 545.0850, GNorm = 0.6554, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.087705
Epoch 4209
Loss = 2.7443e-03, PNorm = 545.1512, GNorm = 0.3620, lr_0 = 9.9728e-04
Loss = 2.7782e-02, PNorm = 545.1965, GNorm = 0.2709, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.082667
Epoch 4210
Loss = 2.3573e-02, PNorm = 545.2393, GNorm = 5.9512, lr_0 = 9.9728e-04
Validation binary_cross_entropy = 0.078794
Epoch 4211
Loss = 3.1922e-02, PNorm = 545.2946, GNorm = 0.2656, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.064818
Epoch 4212
Loss = 1.4684e-02, PNorm = 545.3617, GNorm = 0.2885, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.085309
Epoch 4213
Loss = 3.0609e-02, PNorm = 545.4246, GNorm = 0.0340, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.096268
Epoch 4214
Loss = 1.5926e-02, PNorm = 545.4865, GNorm = 2.2967, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.120426
Epoch 4215
Loss = 7.5487e-02, PNorm = 545.5554, GNorm = 1.0340, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.198216
Epoch 4216
Loss = 1.0311e-01, PNorm = 545.6036, GNorm = 0.3102, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.091382
Epoch 4217
Loss = 1.8471e-02, PNorm = 545.6394, GNorm = 0.6123, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.056841
Epoch 4218
Loss = 1.7869e-02, PNorm = 545.7024, GNorm = 0.8873, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.072797
Epoch 4219
Loss = 2.0370e-02, PNorm = 545.7767, GNorm = 0.6659, lr_0 = 9.9727e-04
Loss = 1.3917e-02, PNorm = 545.8350, GNorm = 0.6991, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.087595
Epoch 4220
Loss = 1.6902e-02, PNorm = 545.8792, GNorm = 0.0149, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.083905
Epoch 4221
Loss = 3.9682e-02, PNorm = 545.9145, GNorm = 0.0509, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.077519
Epoch 4222
Loss = 1.9956e-02, PNorm = 545.9467, GNorm = 1.6906, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.071696
Epoch 4223
Loss = 1.2856e-02, PNorm = 545.9794, GNorm = 0.3666, lr_0 = 9.9727e-04
Validation binary_cross_entropy = 0.072135
Epoch 4224
Loss = 2.8432e-02, PNorm = 546.0267, GNorm = 0.7692, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.063302
Epoch 4225
Loss = 6.8915e-02, PNorm = 546.0836, GNorm = 0.4633, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.057053
Epoch 4226
Loss = 5.1332e-02, PNorm = 546.1785, GNorm = 1.8504, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.083310
Epoch 4227
Loss = 6.5307e-02, PNorm = 546.2463, GNorm = 1.9707, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.060649
Epoch 4228
Loss = 2.9998e-02, PNorm = 546.2868, GNorm = 0.2012, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.061823
Epoch 4229
Loss = 7.2049e-03, PNorm = 546.3423, GNorm = 0.2202, lr_0 = 9.9726e-04
Loss = 7.6570e-03, PNorm = 546.3893, GNorm = 0.2553, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.077424
Epoch 4230
Loss = 1.0159e-02, PNorm = 546.4198, GNorm = 0.2085, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.080503
Epoch 4231
Loss = 1.2982e-02, PNorm = 546.4516, GNorm = 0.0720, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.080874
Epoch 4232
Loss = 2.9578e-02, PNorm = 546.4780, GNorm = 0.0907, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.068833
Epoch 4233
Loss = 2.1389e-02, PNorm = 546.5303, GNorm = 0.7480, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.075072
Epoch 4234
Loss = 4.7975e-02, PNorm = 546.5764, GNorm = 1.3145, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.058954
Epoch 4235
Loss = 2.0469e-02, PNorm = 546.6288, GNorm = 0.1056, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.071509
Epoch 4236
Loss = 3.5425e-02, PNorm = 546.6968, GNorm = 1.1253, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.078618
Epoch 4237
Loss = 1.0719e-02, PNorm = 546.7449, GNorm = 0.1660, lr_0 = 9.9726e-04
Validation binary_cross_entropy = 0.080632
Epoch 4238
Loss = 4.2150e-03, PNorm = 546.7923, GNorm = 0.0736, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.088729
Epoch 4239
Loss = 2.8717e-03, PNorm = 546.8394, GNorm = 0.0564, lr_0 = 9.9725e-04
Loss = 2.5191e-03, PNorm = 546.8752, GNorm = 0.0324, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.097065
Epoch 4240
Loss = 5.9917e-03, PNorm = 546.9011, GNorm = 1.7477, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.109203
Epoch 4241
Loss = 6.2863e-03, PNorm = 546.9171, GNorm = 0.0763, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.116917
Epoch 4242
Loss = 1.7646e-02, PNorm = 546.9640, GNorm = 0.8190, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.122984
Epoch 4243
Loss = 9.9324e-04, PNorm = 547.0083, GNorm = 0.2131, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.118067
Epoch 4244
Loss = 7.6536e-03, PNorm = 547.0380, GNorm = 1.6150, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.097513
Epoch 4245
Loss = 2.7262e-02, PNorm = 547.0740, GNorm = 0.0728, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.098941
Epoch 4246
Loss = 2.3730e-02, PNorm = 547.1482, GNorm = 1.0888, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.107360
Epoch 4247
Loss = 5.9915e-03, PNorm = 547.2033, GNorm = 0.8644, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.084987
Epoch 4248
Loss = 2.2262e-02, PNorm = 547.2634, GNorm = 0.0909, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.080765
Epoch 4249
Loss = 1.7415e-03, PNorm = 547.3388, GNorm = 0.1057, lr_0 = 9.9725e-04
Loss = 1.9861e-02, PNorm = 547.4120, GNorm = 0.4844, lr_0 = 9.9725e-04
Validation binary_cross_entropy = 0.129185
Epoch 4250
Loss = 5.3036e-02, PNorm = 547.4703, GNorm = 0.9051, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.086956
Epoch 4251
Loss = 8.7431e-02, PNorm = 547.5504, GNorm = 0.5504, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.108129
Epoch 4252
Loss = 4.4404e-02, PNorm = 547.6595, GNorm = 3.0296, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.097008
Epoch 4253
Loss = 5.8065e-02, PNorm = 547.7587, GNorm = 0.7267, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.114479
Epoch 4254
Loss = 1.3062e-02, PNorm = 547.8531, GNorm = 0.3969, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.104397
Epoch 4255
Loss = 1.1794e-02, PNorm = 547.9301, GNorm = 0.1677, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.098775
Epoch 4256
Loss = 2.6577e-02, PNorm = 548.0275, GNorm = 0.1991, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.073163
Epoch 4257
Loss = 2.5591e-02, PNorm = 548.1001, GNorm = 0.7952, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.069902
Epoch 4258
Loss = 2.8806e-02, PNorm = 548.2414, GNorm = 1.1611, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.072778
Epoch 4259
Loss = 1.4907e-01, PNorm = 548.3498, GNorm = 2.0436, lr_0 = 9.9724e-04
Loss = 3.8762e-02, PNorm = 548.4520, GNorm = 0.8031, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.063786
Epoch 4260
Loss = 2.5534e-02, PNorm = 548.5468, GNorm = 2.8228, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.083412
Epoch 4261
Loss = 1.3275e-02, PNorm = 548.6021, GNorm = 1.1290, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.091677
Epoch 4262
Loss = 2.9501e-02, PNorm = 548.6485, GNorm = 0.2971, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.083933
Epoch 4263
Loss = 2.6283e-02, PNorm = 548.6922, GNorm = 0.6418, lr_0 = 9.9724e-04
Validation binary_cross_entropy = 0.088772
Epoch 4264
Loss = 3.5583e-02, PNorm = 548.7670, GNorm = 0.8984, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.102074
Epoch 4265
Loss = 1.6809e-02, PNorm = 548.8198, GNorm = 0.0973, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.091647
Epoch 4266
Loss = 2.5447e-02, PNorm = 548.8781, GNorm = 0.5939, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.090192
Epoch 4267
Loss = 6.1257e-03, PNorm = 548.9290, GNorm = 1.4327, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.095768
Epoch 4268
Loss = 2.0158e-03, PNorm = 548.9728, GNorm = 0.0449, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.092185
Epoch 4269
Loss = 1.9734e-01, PNorm = 549.0039, GNorm = 5.0098, lr_0 = 9.9723e-04
Loss = 2.8807e-02, PNorm = 549.0645, GNorm = 0.5253, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.097852
Epoch 4270
Loss = 3.4560e-02, PNorm = 549.1181, GNorm = 0.0721, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.083558
Epoch 4271
Loss = 1.7674e-02, PNorm = 549.1836, GNorm = 1.5050, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.083101
Epoch 4272
Loss = 1.3481e-01, PNorm = 549.2631, GNorm = 0.2812, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.065357
Epoch 4273
Loss = 4.0190e-02, PNorm = 549.3623, GNorm = 0.6927, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.071209
Epoch 4274
Loss = 2.0851e-02, PNorm = 549.4421, GNorm = 0.2120, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.100863
Epoch 4275
Loss = 3.7430e-02, PNorm = 549.5145, GNorm = 0.3852, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.077237
Epoch 4276
Loss = 2.7717e-02, PNorm = 549.5763, GNorm = 0.0995, lr_0 = 9.9723e-04
Validation binary_cross_entropy = 0.070188
Epoch 4277
Loss = 7.5325e-03, PNorm = 549.6406, GNorm = 0.2057, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.082076
Epoch 4278
Loss = 8.5130e-03, PNorm = 549.7044, GNorm = 0.1652, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.084691
Epoch 4279
Loss = 1.5288e-03, PNorm = 549.7629, GNorm = 0.0524, lr_0 = 9.9722e-04
Loss = 1.9484e-02, PNorm = 549.8292, GNorm = 0.6113, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.085957
Epoch 4280
Loss = 2.4443e-02, PNorm = 549.8768, GNorm = 0.9583, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.085770
Epoch 4281
Loss = 7.2528e-03, PNorm = 549.9198, GNorm = 0.0762, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.093826
Epoch 4282
Loss = 4.4776e-02, PNorm = 549.9581, GNorm = 1.0602, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.094600
Epoch 4283
Loss = 7.0675e-02, PNorm = 550.0065, GNorm = 2.0512, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.091901
Epoch 4284
Loss = 4.4165e-02, PNorm = 550.0771, GNorm = 1.1329, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.101966
Epoch 4285
Loss = 2.0180e-02, PNorm = 550.1422, GNorm = 1.3987, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.087552
Epoch 4286
Loss = 4.3338e-03, PNorm = 550.1885, GNorm = 0.2601, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.078919
Epoch 4287
Loss = 4.0460e-02, PNorm = 550.2428, GNorm = 0.6365, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.093330
Epoch 4288
Loss = 2.4545e-02, PNorm = 550.2982, GNorm = 1.0543, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.099748
Epoch 4289
Loss = 1.0939e-02, PNorm = 550.3556, GNorm = 0.5712, lr_0 = 9.9722e-04
Loss = 3.4965e-02, PNorm = 550.4001, GNorm = 7.6798, lr_0 = 9.9722e-04
Validation binary_cross_entropy = 0.148848
Epoch 4290
Loss = 4.8449e-02, PNorm = 550.4488, GNorm = 4.7218, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.096860
Epoch 4291
Loss = 4.2299e-02, PNorm = 550.5365, GNorm = 0.1446, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.076120
Epoch 4292
Loss = 5.9047e-02, PNorm = 550.6355, GNorm = 0.1935, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.075551
Epoch 4293
Loss = 1.7342e-02, PNorm = 550.7220, GNorm = 0.5693, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.080396
Epoch 4294
Loss = 3.7977e-02, PNorm = 550.8075, GNorm = 0.1072, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.114899
Epoch 4295
Loss = 1.6218e-02, PNorm = 550.8745, GNorm = 0.3904, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.102866
Epoch 4296
Loss = 8.9996e-03, PNorm = 550.9255, GNorm = 0.1655, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.101398
Epoch 4297
Loss = 2.8495e-03, PNorm = 550.9814, GNorm = 0.1443, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.105962
Epoch 4298
Loss = 5.5695e-02, PNorm = 551.0446, GNorm = 0.0333, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.158208
Epoch 4299
Loss = 4.7879e-04, PNorm = 551.1133, GNorm = 0.0150, lr_0 = 9.9721e-04
Loss = 3.4642e-02, PNorm = 551.1736, GNorm = 0.9574, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.136518
Epoch 4300
Loss = 5.7114e-02, PNorm = 551.2746, GNorm = 0.3737, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.102685
Epoch 4301
Loss = 3.7858e-02, PNorm = 551.3962, GNorm = 1.7688, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.130101
Epoch 4302
Loss = 5.6374e-02, PNorm = 551.5013, GNorm = 0.2702, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.075828
Epoch 4303
Loss = 6.2311e-02, PNorm = 551.6172, GNorm = 0.3604, lr_0 = 9.9721e-04
Validation binary_cross_entropy = 0.082822
Epoch 4304
Loss = 4.1153e-02, PNorm = 551.7103, GNorm = 1.6378, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.073217
Epoch 4305
Loss = 1.2530e-01, PNorm = 551.7966, GNorm = 3.4229, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.083654
Epoch 4306
Loss = 3.6405e-02, PNorm = 551.8861, GNorm = 2.0022, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.070542
Epoch 4307
Loss = 3.9479e-02, PNorm = 551.9599, GNorm = 0.3629, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.069629
Epoch 4308
Loss = 2.0155e-02, PNorm = 552.0327, GNorm = 0.6259, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.083582
Epoch 4309
Loss = 7.1386e-02, PNorm = 552.0913, GNorm = 1.6278, lr_0 = 9.9720e-04
Loss = 1.1348e-02, PNorm = 552.1402, GNorm = 0.1417, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.079562
Epoch 4310
Loss = 4.1302e-02, PNorm = 552.1938, GNorm = 3.8252, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.073617
Epoch 4311
Loss = 4.8075e-02, PNorm = 552.2861, GNorm = 1.4512, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.107779
Epoch 4312
Loss = 3.2270e-02, PNorm = 552.3544, GNorm = 1.8532, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.081233
Epoch 4313
Loss = 8.6348e-03, PNorm = 552.4210, GNorm = 0.3708, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.109266
Epoch 4314
Loss = 5.0587e-03, PNorm = 552.4688, GNorm = 0.0140, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.154979
Epoch 4315
Loss = 1.7971e-02, PNorm = 552.4974, GNorm = 0.3800, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.097119
Epoch 4316
Loss = 4.7082e-02, PNorm = 552.5374, GNorm = 0.1665, lr_0 = 9.9720e-04
Validation binary_cross_entropy = 0.087779
Epoch 4317
Loss = 3.1732e-03, PNorm = 552.5916, GNorm = 0.1532, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.079918
Epoch 4318
Loss = 2.2652e-02, PNorm = 552.6318, GNorm = 2.2618, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.083841
Epoch 4319
Loss = 9.4606e-03, PNorm = 552.6815, GNorm = 0.4214, lr_0 = 9.9719e-04
Loss = 2.7920e-02, PNorm = 552.7374, GNorm = 0.2465, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.074559
Epoch 4320
Loss = 1.7253e-02, PNorm = 552.7937, GNorm = 0.8471, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.086189
Epoch 4321
Loss = 2.6156e-02, PNorm = 552.8414, GNorm = 0.1096, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.099956
Epoch 4322
Loss = 1.1551e-02, PNorm = 552.8871, GNorm = 0.8097, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.126619
Epoch 4323
Loss = 6.8576e-03, PNorm = 552.9214, GNorm = 0.0094, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.130050
Epoch 4324
Loss = 1.3000e-02, PNorm = 552.9426, GNorm = 0.6542, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.148087
Epoch 4325
Loss = 9.0663e-02, PNorm = 552.9823, GNorm = 7.2270, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.090083
Epoch 4326
Loss = 4.3144e-02, PNorm = 553.0475, GNorm = 3.1452, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.113957
Epoch 4327
Loss = 1.7143e-03, PNorm = 553.1320, GNorm = 0.0017, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.114456
Epoch 4328
Loss = 6.1842e-02, PNorm = 553.1787, GNorm = 1.5767, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.090474
Epoch 4329
Loss = 2.3554e-02, PNorm = 553.2288, GNorm = 0.9820, lr_0 = 9.9719e-04
Loss = 1.1394e-02, PNorm = 553.2866, GNorm = 0.1550, lr_0 = 9.9719e-04
Validation binary_cross_entropy = 0.093564
Epoch 4330
Loss = 2.9856e-02, PNorm = 553.3385, GNorm = 2.9766, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.115450
Epoch 4331
Loss = 8.1784e-03, PNorm = 553.3910, GNorm = 0.2555, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.115455
Epoch 4332
Loss = 3.0309e-02, PNorm = 553.4431, GNorm = 3.2990, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.113194
Epoch 4333
Loss = 1.3842e-01, PNorm = 553.4967, GNorm = 0.8686, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.080095
Epoch 4334
Loss = 5.0623e-02, PNorm = 553.6103, GNorm = 0.1454, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.084324
Epoch 4335
Loss = 1.6976e-02, PNorm = 553.6973, GNorm = 0.4754, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.075835
Epoch 4336
Loss = 1.9512e-02, PNorm = 553.7493, GNorm = 0.2090, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.073339
Epoch 4337
Loss = 4.2805e-02, PNorm = 553.8322, GNorm = 1.8312, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.092361
Epoch 4338
Loss = 1.9495e-02, PNorm = 553.9232, GNorm = 0.7427, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.088255
Epoch 4339
Loss = 5.8868e-02, PNorm = 553.9945, GNorm = 3.6547, lr_0 = 9.9718e-04
Loss = 1.1822e-02, PNorm = 554.0627, GNorm = 0.5165, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.099038
Epoch 4340
Loss = 3.3572e-02, PNorm = 554.1270, GNorm = 0.6001, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.102050
Epoch 4341
Loss = 2.2320e-02, PNorm = 554.1937, GNorm = 0.3686, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.092961
Epoch 4342
Loss = 3.0476e-02, PNorm = 554.2611, GNorm = 0.0460, lr_0 = 9.9718e-04
Validation binary_cross_entropy = 0.083061
Epoch 4343
Loss = 2.4316e-02, PNorm = 554.3358, GNorm = 0.1962, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.090772
Epoch 4344
Loss = 5.3882e-02, PNorm = 554.3887, GNorm = 0.8299, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.079602
Epoch 4345
Loss = 1.2000e-02, PNorm = 554.4527, GNorm = 0.1427, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.075642
Epoch 4346
Loss = 2.3939e-02, PNorm = 554.5047, GNorm = 1.2612, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.116829
Epoch 4347
Loss = 2.4551e-03, PNorm = 554.5579, GNorm = 0.0034, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.150468
Epoch 4348
Loss = 8.8352e-04, PNorm = 554.5982, GNorm = 0.1377, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.153699
Epoch 4349
Loss = 2.8807e-03, PNorm = 554.6232, GNorm = 0.1214, lr_0 = 9.9717e-04
Loss = 1.1682e-02, PNorm = 554.6580, GNorm = 0.0344, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.196519
Epoch 4350
Loss = 2.3357e-02, PNorm = 554.6922, GNorm = 0.5774, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.146319
Epoch 4351
Loss = 2.6440e-02, PNorm = 554.7269, GNorm = 7.3109, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.147154
Epoch 4352
Loss = 1.9379e-02, PNorm = 554.7617, GNorm = 2.0806, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.139579
Epoch 4353
Loss = 1.3067e-02, PNorm = 554.8165, GNorm = 0.9613, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.117041
Epoch 4354
Loss = 1.9395e-02, PNorm = 554.9215, GNorm = 1.1060, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.137624
Epoch 4355
Loss = 1.7163e-02, PNorm = 555.0112, GNorm = 0.3321, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.152071
Epoch 4356
Loss = 3.4318e-02, PNorm = 555.0623, GNorm = 6.0077, lr_0 = 9.9717e-04
Validation binary_cross_entropy = 0.134105
Epoch 4357
Loss = 8.7953e-03, PNorm = 555.0992, GNorm = 0.7825, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.123189
Epoch 4358
Loss = 5.8638e-03, PNorm = 555.1406, GNorm = 0.3031, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.129273
Epoch 4359
Loss = 2.1012e-02, PNorm = 555.2003, GNorm = 0.9816, lr_0 = 9.9716e-04
Loss = 1.9620e-02, PNorm = 555.2618, GNorm = 0.0777, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.159319
Epoch 4360
Loss = 3.0721e-02, PNorm = 555.2925, GNorm = 0.1458, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.145835
Epoch 4361
Loss = 1.7294e-02, PNorm = 555.3246, GNorm = 0.5138, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.109212
Epoch 4362
Loss = 1.3134e-01, PNorm = 555.5168, GNorm = 2.2514, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.159679
Epoch 4363
Loss = 1.1258e-01, PNorm = 555.7182, GNorm = 1.6817, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.181622
Epoch 4364
Loss = 1.3171e-01, PNorm = 555.8694, GNorm = 3.2620, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.125397
Epoch 4365
Loss = 1.5305e-01, PNorm = 555.9908, GNorm = 1.4638, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.081861
Epoch 4366
Loss = 7.7430e-02, PNorm = 556.0859, GNorm = 1.7446, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.142403
Epoch 4367
Loss = 5.7845e-02, PNorm = 556.1964, GNorm = 3.1857, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.072274
Epoch 4368
Loss = 3.2605e-02, PNorm = 556.2837, GNorm = 1.4954, lr_0 = 9.9716e-04
Validation binary_cross_entropy = 0.085453
Epoch 4369
Loss = 2.5200e-02, PNorm = 556.3876, GNorm = 0.8390, lr_0 = 9.9716e-04
Loss = 3.2677e-02, PNorm = 556.4780, GNorm = 0.9282, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.124973
Epoch 4370
Loss = 5.2657e-02, PNorm = 556.5410, GNorm = 1.1368, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.115619
Epoch 4371
Loss = 3.1728e-02, PNorm = 556.6108, GNorm = 1.5042, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.204408
Epoch 4372
Loss = 2.4230e-02, PNorm = 556.6831, GNorm = 0.1728, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.178209
Epoch 4373
Loss = 2.0977e-01, PNorm = 556.7477, GNorm = 2.8465, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.112857
Epoch 4374
Loss = 2.0436e-02, PNorm = 556.8431, GNorm = 1.6096, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.148634
Epoch 4375
Loss = 4.2254e-02, PNorm = 556.9197, GNorm = 1.5133, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.098722
Epoch 4376
Loss = 1.6853e-02, PNorm = 556.9732, GNorm = 0.7255, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.113177
Epoch 4377
Loss = 2.5194e-02, PNorm = 557.0456, GNorm = 0.5527, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.133459
Epoch 4378
Loss = 4.1725e-02, PNorm = 557.1038, GNorm = 3.4720, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.123999
Epoch 4379
Loss = 1.8672e-02, PNorm = 557.1530, GNorm = 2.1841, lr_0 = 9.9715e-04
Loss = 3.7217e-02, PNorm = 557.2125, GNorm = 1.5295, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.144219
Epoch 4380
Loss = 2.9746e-03, PNorm = 557.2591, GNorm = 0.1535, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.148160
Epoch 4381
Loss = 6.4440e-02, PNorm = 557.2817, GNorm = 2.7533, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.093009
Epoch 4382
Loss = 3.0535e-02, PNorm = 557.3191, GNorm = 0.0911, lr_0 = 9.9715e-04
Validation binary_cross_entropy = 0.090465
Epoch 4383
Loss = 2.5442e-02, PNorm = 557.3744, GNorm = 0.2949, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.083786
Epoch 4384
Loss = 1.7804e-02, PNorm = 557.4246, GNorm = 1.9961, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.096334
Epoch 4385
Loss = 1.9992e-02, PNorm = 557.4695, GNorm = 0.3548, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.090720
Epoch 4386
Loss = 3.2091e-02, PNorm = 557.5259, GNorm = 0.1751, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.087690
Epoch 4387
Loss = 7.6393e-03, PNorm = 557.5676, GNorm = 0.2586, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.074402
Epoch 4388
Loss = 1.2404e-02, PNorm = 557.6263, GNorm = 1.1736, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.071878
Epoch 4389
Loss = 1.6226e-02, PNorm = 557.6863, GNorm = 0.6072, lr_0 = 9.9714e-04
Loss = 2.3971e-02, PNorm = 557.7450, GNorm = 2.4631, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.090297
Epoch 4390
Loss = 3.2906e-02, PNorm = 557.7984, GNorm = 0.6746, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.086495
Epoch 4391
Loss = 1.9991e-02, PNorm = 557.8460, GNorm = 0.3012, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.101141
Epoch 4392
Loss = 2.5273e-02, PNorm = 557.9035, GNorm = 0.0473, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.137912
Epoch 4393
Loss = 6.0692e-02, PNorm = 557.9469, GNorm = 0.3805, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.090848
Epoch 4394
Loss = 3.0824e-02, PNorm = 557.9885, GNorm = 0.6188, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.104998
Epoch 4395
Loss = 5.3482e-02, PNorm = 558.0525, GNorm = 0.8051, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.089025
Epoch 4396
Loss = 3.0201e-02, PNorm = 558.1170, GNorm = 0.8625, lr_0 = 9.9714e-04
Validation binary_cross_entropy = 0.104699
Epoch 4397
Loss = 2.7994e-02, PNorm = 558.1873, GNorm = 0.0646, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.105906
Epoch 4398
Loss = 2.9845e-03, PNorm = 558.2354, GNorm = 0.0383, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.091229
Epoch 4399
Loss = 2.3852e-02, PNorm = 558.2816, GNorm = 1.3063, lr_0 = 9.9713e-04
Loss = 2.6331e-02, PNorm = 558.3525, GNorm = 1.0820, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.101632
Epoch 4400
Loss = 2.3814e-02, PNorm = 558.4460, GNorm = 0.1795, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.120089
Epoch 4401
Loss = 3.7316e-01, PNorm = 558.5352, GNorm = 0.2711, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.085849
Epoch 4402
Loss = 5.3990e-02, PNorm = 558.7101, GNorm = 1.9841, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.096275
Epoch 4403
Loss = 6.5282e-02, PNorm = 558.8467, GNorm = 2.6319, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.104602
Epoch 4404
Loss = 8.8750e-02, PNorm = 558.9369, GNorm = 2.3110, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.088054
Epoch 4405
Loss = 4.6623e-02, PNorm = 559.0082, GNorm = 3.3714, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.113998
Epoch 4406
Loss = 4.9735e-02, PNorm = 559.0780, GNorm = 0.9846, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.101251
Epoch 4407
Loss = 7.1494e-02, PNorm = 559.1301, GNorm = 3.2504, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.106945
Epoch 4408
Loss = 2.1614e-02, PNorm = 559.2008, GNorm = 0.7165, lr_0 = 9.9713e-04
Validation binary_cross_entropy = 0.106403
Epoch 4409
Loss = 4.4592e-02, PNorm = 559.2466, GNorm = 1.7539, lr_0 = 9.9713e-04
Loss = 2.2869e-02, PNorm = 559.2851, GNorm = 0.3054, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.098547
Epoch 4410
Loss = 2.2956e-02, PNorm = 559.3213, GNorm = 0.0954, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.121232
Epoch 4411
Loss = 4.5027e-02, PNorm = 559.3578, GNorm = 2.5697, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.102216
Epoch 4412
Loss = 1.3796e-02, PNorm = 559.4085, GNorm = 0.1363, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.138591
Epoch 4413
Loss = 4.4447e-02, PNorm = 559.4501, GNorm = 0.9827, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.120408
Epoch 4414
Loss = 4.8642e-02, PNorm = 559.4834, GNorm = 0.1569, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.088078
Epoch 4415
Loss = 2.9745e-02, PNorm = 559.5253, GNorm = 1.4643, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.101633
Epoch 4416
Loss = 2.6966e-02, PNorm = 559.5780, GNorm = 1.1596, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.115407
Epoch 4417
Loss = 2.2639e-03, PNorm = 559.6194, GNorm = 0.1318, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.113321
Epoch 4418
Loss = 7.2316e-03, PNorm = 559.6434, GNorm = 0.0336, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.115095
Epoch 4419
Loss = 1.1933e-02, PNorm = 559.6686, GNorm = 0.4006, lr_0 = 9.9712e-04
Loss = 1.2357e-02, PNorm = 559.7014, GNorm = 0.0468, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.126182
Epoch 4420
Loss = 2.1094e-02, PNorm = 559.7400, GNorm = 1.9568, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.198594
Epoch 4421
Loss = 2.6244e-02, PNorm = 559.7801, GNorm = 0.0397, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.147181
Epoch 4422
Loss = 2.0719e-02, PNorm = 559.8128, GNorm = 0.0656, lr_0 = 9.9712e-04
Validation binary_cross_entropy = 0.123390
Epoch 4423
Loss = 1.2222e-02, PNorm = 559.8377, GNorm = 1.8005, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.109165
Epoch 4424
Loss = 2.1607e-02, PNorm = 559.8785, GNorm = 1.2379, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.120085
Epoch 4425
Loss = 2.1122e-02, PNorm = 559.9309, GNorm = 0.0646, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.137882
Epoch 4426
Loss = 1.3404e-02, PNorm = 559.9710, GNorm = 0.0195, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.152519
Epoch 4427
Loss = 1.0482e-02, PNorm = 560.0134, GNorm = 0.1631, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.191278
Epoch 4428
Loss = 4.8032e-03, PNorm = 560.0531, GNorm = 0.3422, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.114711
Epoch 4429
Loss = 4.6783e-03, PNorm = 560.0850, GNorm = 0.2092, lr_0 = 9.9711e-04
Loss = 1.3934e-02, PNorm = 560.1351, GNorm = 0.4146, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.106076
Epoch 4430
Loss = 3.7937e-02, PNorm = 560.1852, GNorm = 0.7217, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.110406
Epoch 4431
Loss = 5.9634e-02, PNorm = 560.2584, GNorm = 0.5125, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.107390
Epoch 4432
Loss = 4.0255e-02, PNorm = 560.3470, GNorm = 0.0979, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.108273
Epoch 4433
Loss = 2.4826e-02, PNorm = 560.4104, GNorm = 0.8227, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.090482
Epoch 4434
Loss = 3.0634e-02, PNorm = 560.4623, GNorm = 0.8680, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.089915
Epoch 4435
Loss = 3.9756e-02, PNorm = 560.5314, GNorm = 0.6840, lr_0 = 9.9711e-04
Validation binary_cross_entropy = 0.111774
Epoch 4436
Loss = 2.1890e-02, PNorm = 560.5926, GNorm = 0.1532, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.096512
Epoch 4437
Loss = 1.4813e-02, PNorm = 560.6359, GNorm = 0.0716, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.101145
Epoch 4438
Loss = 1.1574e-02, PNorm = 560.6926, GNorm = 0.6667, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.098879
Epoch 4439
Loss = 8.3383e-03, PNorm = 560.7360, GNorm = 0.6447, lr_0 = 9.9710e-04
Loss = 3.4437e-02, PNorm = 560.7733, GNorm = 0.1395, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.110871
Epoch 4440
Loss = 3.1623e-02, PNorm = 560.8055, GNorm = 0.2933, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.098224
Epoch 4441
Loss = 1.3917e-02, PNorm = 560.8336, GNorm = 2.0839, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.091178
Epoch 4442
Loss = 5.0914e-02, PNorm = 560.8778, GNorm = 1.0753, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.097944
Epoch 4443
Loss = 2.0073e-02, PNorm = 560.9141, GNorm = 0.2094, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.082890
Epoch 4444
Loss = 1.7233e-02, PNorm = 560.9579, GNorm = 0.4803, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.072222
Epoch 4445
Loss = 2.2980e-02, PNorm = 561.0277, GNorm = 0.1953, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.075382
Epoch 4446
Loss = 6.4935e-02, PNorm = 561.0792, GNorm = 1.7926, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.076807
Epoch 4447
Loss = 9.9707e-03, PNorm = 561.1280, GNorm = 0.7202, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.080878
Epoch 4448
Loss = 4.6352e-02, PNorm = 561.1687, GNorm = 1.6893, lr_0 = 9.9710e-04
Validation binary_cross_entropy = 0.079398
Epoch 4449
Loss = 7.7862e-03, PNorm = 561.2110, GNorm = 0.2477, lr_0 = 9.9710e-04
Loss = 2.6764e-02, PNorm = 561.2646, GNorm = 2.1823, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.103503
Epoch 4450
Loss = 3.0706e-02, PNorm = 561.3158, GNorm = 0.6041, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.079661
Epoch 4451
Loss = 1.8797e-02, PNorm = 561.3729, GNorm = 1.2027, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.098818
Epoch 4452
Loss = 4.8044e-02, PNorm = 561.4185, GNorm = 0.2144, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.087570
Epoch 4453
Loss = 2.3736e-02, PNorm = 561.4737, GNorm = 3.8099, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.110254
Epoch 4454
Loss = 6.9358e-02, PNorm = 561.5392, GNorm = 0.7162, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.100126
Epoch 4455
Loss = 3.6236e-02, PNorm = 561.6029, GNorm = 1.5888, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.117834
Epoch 4456
Loss = 6.6481e-02, PNorm = 561.6593, GNorm = 0.4346, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.082394
Epoch 4457
Loss = 2.0350e-02, PNorm = 561.7399, GNorm = 1.1421, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.090008
Epoch 4458
Loss = 7.0372e-03, PNorm = 561.8125, GNorm = 0.2087, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.101929
Epoch 4459
Loss = 4.1724e-02, PNorm = 561.8683, GNorm = 1.4451, lr_0 = 9.9709e-04
Loss = 1.5563e-02, PNorm = 561.9099, GNorm = 1.9866, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.092904
Epoch 4460
Loss = 6.4801e-03, PNorm = 561.9436, GNorm = 0.0432, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.097764
Epoch 4461
Loss = 2.3746e-02, PNorm = 561.9821, GNorm = 0.0378, lr_0 = 9.9709e-04
Validation binary_cross_entropy = 0.129741
Epoch 4462
Loss = 2.7285e-02, PNorm = 562.0163, GNorm = 0.0489, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.095774
Epoch 4463
Loss = 1.9284e-02, PNorm = 562.0512, GNorm = 0.2463, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.096940
Epoch 4464
Loss = 2.5412e-02, PNorm = 562.1232, GNorm = 0.4288, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.120866
Epoch 4465
Loss = 5.5332e-02, PNorm = 562.1772, GNorm = 0.1180, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.072438
Epoch 4466
Loss = 3.4990e-02, PNorm = 562.2353, GNorm = 0.4874, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.074865
Epoch 4467
Loss = 1.6003e-02, PNorm = 562.2909, GNorm = 0.3841, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.082694
Epoch 4468
Loss = 1.8586e-02, PNorm = 562.3574, GNorm = 0.0437, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.095810
Epoch 4469
Loss = 2.7994e-03, PNorm = 562.4147, GNorm = 0.1235, lr_0 = 9.9708e-04
Loss = 1.3328e-02, PNorm = 562.4503, GNorm = 0.2798, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.088142
Epoch 4470
Loss = 1.3770e-02, PNorm = 562.4946, GNorm = 1.2851, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.092114
Epoch 4471
Loss = 1.7672e-02, PNorm = 562.5447, GNorm = 0.0958, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.130875
Epoch 4472
Loss = 1.2319e-02, PNorm = 562.5927, GNorm = 0.4112, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.186186
Epoch 4473
Loss = 1.0921e-02, PNorm = 562.6287, GNorm = 0.1125, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.175351
Epoch 4474
Loss = 5.2825e-02, PNorm = 562.6612, GNorm = 0.5514, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.147613
Epoch 4475
Loss = 1.0356e-02, PNorm = 562.7487, GNorm = 0.6823, lr_0 = 9.9708e-04
Validation binary_cross_entropy = 0.201609
Epoch 4476
Loss = 8.2162e-03, PNorm = 562.8188, GNorm = 0.2617, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.168522
Epoch 4477
Loss = 3.7794e-03, PNorm = 562.8622, GNorm = 0.2729, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.134891
Epoch 4478
Loss = 5.6937e-02, PNorm = 562.9285, GNorm = 1.8484, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.132175
Epoch 4479
Loss = 5.7483e-03, PNorm = 562.9928, GNorm = 0.1237, lr_0 = 9.9707e-04
Loss = 6.7327e-02, PNorm = 563.0507, GNorm = 5.0952, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.095747
Epoch 4480
Loss = 3.4905e-02, PNorm = 563.1093, GNorm = 0.4163, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.065320
Epoch 4481
Loss = 4.0080e-02, PNorm = 563.1918, GNorm = 0.4067, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.070079
Epoch 4482
Loss = 3.5149e-02, PNorm = 563.2701, GNorm = 0.6683, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.078837
Epoch 4483
Loss = 3.6863e-02, PNorm = 563.3344, GNorm = 0.5079, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.077214
Epoch 4484
Loss = 5.6455e-02, PNorm = 563.4033, GNorm = 0.4554, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.072586
Epoch 4485
Loss = 1.4853e-02, PNorm = 563.4825, GNorm = 0.8703, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.081204
Epoch 4486
Loss = 1.5182e-02, PNorm = 563.5482, GNorm = 0.3667, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.083233
Epoch 4487
Loss = 9.6706e-02, PNorm = 563.5989, GNorm = 9.2556, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.080542
Epoch 4488
Loss = 7.0835e-03, PNorm = 563.6524, GNorm = 0.1323, lr_0 = 9.9707e-04
Validation binary_cross_entropy = 0.075347
Epoch 4489
Loss = 3.0527e-02, PNorm = 563.7049, GNorm = 1.3909, lr_0 = 9.9707e-04
Loss = 2.4899e-02, PNorm = 563.7608, GNorm = 1.1823, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.083648
Epoch 4490
Loss = 3.3282e-02, PNorm = 563.8060, GNorm = 0.8256, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.075865
Epoch 4491
Loss = 2.2334e-02, PNorm = 563.8537, GNorm = 2.6451, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.081016
Epoch 4492
Loss = 5.1427e-03, PNorm = 563.9101, GNorm = 0.0305, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.106075
Epoch 4493
Loss = 4.7713e-02, PNorm = 563.9457, GNorm = 1.3507, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.090822
Epoch 4494
Loss = 9.2069e-03, PNorm = 563.9842, GNorm = 0.6374, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.080535
Epoch 4495
Loss = 4.1344e-02, PNorm = 564.0436, GNorm = 0.3841, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.071999
Epoch 4496
Loss = 1.8747e-02, PNorm = 564.0986, GNorm = 1.4225, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.070680
Epoch 4497
Loss = 2.2089e-02, PNorm = 564.1648, GNorm = 1.8596, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.088404
Epoch 4498
Loss = 2.8441e-02, PNorm = 564.2210, GNorm = 0.0312, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.079288
Epoch 4499
Loss = 1.5088e-03, PNorm = 564.2596, GNorm = 0.0457, lr_0 = 9.9706e-04
Loss = 7.2130e-03, PNorm = 564.3134, GNorm = 0.2308, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.086372
Epoch 4500
Loss = 1.4224e-02, PNorm = 564.3719, GNorm = 0.0488, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.113190
Epoch 4501
Loss = 1.0152e-02, PNorm = 564.4201, GNorm = 0.1422, lr_0 = 9.9706e-04
Validation binary_cross_entropy = 0.117908
Epoch 4502
Loss = 6.2506e-03, PNorm = 564.4495, GNorm = 0.3898, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.113183
Epoch 4503
Loss = 3.1510e-02, PNorm = 564.4715, GNorm = 0.2525, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.074573
Epoch 4504
Loss = 1.0771e-01, PNorm = 564.6992, GNorm = 1.6717, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.211283
Epoch 4505
Loss = 1.2240e-01, PNorm = 564.9710, GNorm = 2.8707, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.065637
Epoch 4506
Loss = 3.8081e-02, PNorm = 565.1488, GNorm = 0.6278, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.171211
Epoch 4507
Loss = 8.3315e-02, PNorm = 565.2775, GNorm = 0.6086, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.082466
Epoch 4508
Loss = 5.7534e-02, PNorm = 565.3826, GNorm = 0.8619, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.109098
Epoch 4509
Loss = 8.3809e-01, PNorm = 565.4805, GNorm = 28.5741, lr_0 = 9.9705e-04
Loss = 7.2295e-02, PNorm = 565.6193, GNorm = 3.8370, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.119126
Epoch 4510
Loss = 1.4446e-01, PNorm = 565.7533, GNorm = 4.1033, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.083880
Epoch 4511
Loss = 1.0637e-01, PNorm = 565.8844, GNorm = 3.5236, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.100867
Epoch 4512
Loss = 5.6765e-02, PNorm = 565.9762, GNorm = 1.7687, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.082334
Epoch 4513
Loss = 7.9881e-02, PNorm = 566.0587, GNorm = 1.7489, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.133324
Epoch 4514
Loss = 3.7547e-02, PNorm = 566.1197, GNorm = 1.9412, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.115150
Epoch 4515
Loss = 4.4302e-02, PNorm = 566.1688, GNorm = 1.0606, lr_0 = 9.9705e-04
Validation binary_cross_entropy = 0.123857
Epoch 4516
Loss = 6.8446e-02, PNorm = 566.2412, GNorm = 2.1946, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.087600
Epoch 4517
Loss = 5.9631e-02, PNorm = 566.3177, GNorm = 0.5060, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.096101
Epoch 4518
Loss = 3.9415e-02, PNorm = 566.3971, GNorm = 1.0193, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.090033
Epoch 4519
Loss = 2.2555e-02, PNorm = 566.4630, GNorm = 0.6695, lr_0 = 9.9704e-04
Loss = 6.4414e-02, PNorm = 566.5279, GNorm = 1.7245, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.092309
Epoch 4520
Loss = 3.4585e-02, PNorm = 566.5830, GNorm = 3.8563, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.066558
Epoch 4521
Loss = 5.0787e-02, PNorm = 566.6460, GNorm = 0.3958, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.065263
Epoch 4522
Loss = 2.3356e-02, PNorm = 566.7115, GNorm = 0.4906, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.081913
Epoch 4523
Loss = 2.6785e-02, PNorm = 566.7592, GNorm = 0.6064, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.082614
Epoch 4524
Loss = 3.7234e-02, PNorm = 566.7955, GNorm = 1.4363, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.068532
Epoch 4525
Loss = 1.9325e-02, PNorm = 566.8376, GNorm = 2.1043, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.079927
Epoch 4526
Loss = 4.1673e-03, PNorm = 566.8932, GNorm = 0.7089, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.089757
Epoch 4527
Loss = 2.0676e-02, PNorm = 566.9304, GNorm = 1.5311, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.079991
Epoch 4528
Loss = 5.3890e-03, PNorm = 566.9682, GNorm = 0.3183, lr_0 = 9.9704e-04
Validation binary_cross_entropy = 0.082305
Epoch 4529
Loss = 4.0242e-02, PNorm = 567.0133, GNorm = 1.3904, lr_0 = 9.9703e-04
Loss = 1.2999e-02, PNorm = 567.0648, GNorm = 0.6163, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.078162
Epoch 4530
Loss = 2.6919e-02, PNorm = 567.1166, GNorm = 0.1824, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.080606
Epoch 4531
Loss = 3.5269e-02, PNorm = 567.1701, GNorm = 0.9624, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.094554
Epoch 4532
Loss = 2.4936e-02, PNorm = 567.2147, GNorm = 0.1343, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.089315
Epoch 4533
Loss = 9.2019e-03, PNorm = 567.2511, GNorm = 0.8063, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.085796
Epoch 4534
Loss = 3.6950e-02, PNorm = 567.2738, GNorm = 0.1759, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.075067
Epoch 4535
Loss = 2.9333e-02, PNorm = 567.3222, GNorm = 0.2318, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.085156
Epoch 4536
Loss = 2.3732e-02, PNorm = 567.3826, GNorm = 3.5602, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.091008
Epoch 4537
Loss = 2.2837e-02, PNorm = 567.4366, GNorm = 0.6926, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.093941
Epoch 4538
Loss = 3.3194e-02, PNorm = 567.5026, GNorm = 0.1297, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.107760
Epoch 4539
Loss = 1.0197e-02, PNorm = 567.5608, GNorm = 0.9129, lr_0 = 9.9703e-04
Loss = 1.3125e-01, PNorm = 567.6079, GNorm = 3.1531, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.075081
Epoch 4540
Loss = 6.0301e-02, PNorm = 567.6855, GNorm = 4.1708, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.089948
Epoch 4541
Loss = 3.0717e-02, PNorm = 567.7368, GNorm = 2.2674, lr_0 = 9.9703e-04
Validation binary_cross_entropy = 0.072837
Epoch 4542
Loss = 3.5166e-02, PNorm = 567.7964, GNorm = 0.1782, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.078034
Epoch 4543
Loss = 5.8648e-02, PNorm = 567.8464, GNorm = 0.1436, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.076821
Epoch 4544
Loss = 1.7829e-02, PNorm = 567.8904, GNorm = 0.9739, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.067762
Epoch 4545
Loss = 3.3603e-02, PNorm = 567.9347, GNorm = 0.4969, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.077654
Epoch 4546
Loss = 9.2669e-03, PNorm = 567.9970, GNorm = 0.2089, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.082840
Epoch 4547
Loss = 3.1666e-03, PNorm = 568.0383, GNorm = 0.4421, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.076566
Epoch 4548
Loss = 6.6197e-03, PNorm = 568.0782, GNorm = 0.0742, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.079245
Epoch 4549
Loss = 1.8342e-03, PNorm = 568.1244, GNorm = 0.1255, lr_0 = 9.9702e-04
Loss = 1.3955e-02, PNorm = 568.1649, GNorm = 0.6774, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.084315
Epoch 4550
Loss = 1.4670e-02, PNorm = 568.1984, GNorm = 0.3573, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.100266
Epoch 4551
Loss = 3.4844e-02, PNorm = 568.2319, GNorm = 3.3671, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.094943
Epoch 4552
Loss = 2.8401e-03, PNorm = 568.2702, GNorm = 0.2111, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.100980
Epoch 4553
Loss = 1.2235e-02, PNorm = 568.2988, GNorm = 0.8544, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.096548
Epoch 4554
Loss = 2.1049e-02, PNorm = 568.3329, GNorm = 1.1021, lr_0 = 9.9702e-04
Validation binary_cross_entropy = 0.106916
Epoch 4555
Loss = 1.4372e-02, PNorm = 568.3783, GNorm = 0.0490, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.101214
Epoch 4556
Loss = 1.1935e-02, PNorm = 568.4199, GNorm = 1.5867, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.090027
Epoch 4557
Loss = 1.7863e-02, PNorm = 568.4630, GNorm = 1.0478, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.080885
Epoch 4558
Loss = 3.4126e-03, PNorm = 568.5007, GNorm = 0.1402, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.070428
Epoch 4559
Loss = 5.4490e-03, PNorm = 568.5403, GNorm = 0.3325, lr_0 = 9.9701e-04
Loss = 4.1337e-02, PNorm = 568.6073, GNorm = 0.0767, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.076506
Epoch 4560
Loss = 2.0225e-02, PNorm = 568.6762, GNorm = 1.0141, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.086443
Epoch 4561
Loss = 1.2975e-02, PNorm = 568.7167, GNorm = 1.9093, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.082999
Epoch 4562
Loss = 1.3721e-02, PNorm = 568.7456, GNorm = 1.1928, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.074416
Epoch 4563
Loss = 7.3936e-02, PNorm = 568.8122, GNorm = 8.7365, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.081330
Epoch 4564
Loss = 4.4615e-02, PNorm = 568.8672, GNorm = 0.1381, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.064861
Epoch 4565
Loss = 3.8544e-02, PNorm = 568.9192, GNorm = 1.4517, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.063028
Epoch 4566
Loss = 2.8780e-02, PNorm = 568.9694, GNorm = 0.1647, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.070219
Epoch 4567
Loss = 4.8563e-02, PNorm = 569.0249, GNorm = 0.9812, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.062653
Epoch 4568
Loss = 1.2985e-02, PNorm = 569.0654, GNorm = 0.4923, lr_0 = 9.9701e-04
Validation binary_cross_entropy = 0.091894
Epoch 4569
Loss = 6.0459e-03, PNorm = 569.1188, GNorm = 0.3768, lr_0 = 9.9700e-04
Loss = 1.3920e-02, PNorm = 569.1552, GNorm = 0.1866, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.109013
Epoch 4570
Loss = 1.9383e-02, PNorm = 569.1865, GNorm = 3.5821, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.117823
Epoch 4571
Loss = 2.8432e-02, PNorm = 569.2173, GNorm = 0.6116, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.078179
Epoch 4572
Loss = 5.0309e-02, PNorm = 569.2964, GNorm = 0.0553, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.094672
Epoch 4573
Loss = 3.1277e-02, PNorm = 569.3547, GNorm = 1.7777, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.084700
Epoch 4574
Loss = 1.7473e-02, PNorm = 569.3948, GNorm = 0.3677, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.080781
Epoch 4575
Loss = 3.1836e-02, PNorm = 569.4283, GNorm = 0.0578, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.086756
Epoch 4576
Loss = 3.7041e-03, PNorm = 569.4623, GNorm = 0.2254, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.099871
Epoch 4577
Loss = 3.9839e-02, PNorm = 569.4889, GNorm = 0.0395, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.093115
Epoch 4578
Loss = 7.9371e-03, PNorm = 569.5148, GNorm = 0.0347, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.088899
Epoch 4579
Loss = 2.3481e-03, PNorm = 569.5388, GNorm = 0.0811, lr_0 = 9.9700e-04
Loss = 4.0796e-02, PNorm = 569.5835, GNorm = 1.2381, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.079638
Epoch 4580
Loss = 2.2134e-02, PNorm = 569.6641, GNorm = 0.8891, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.094396
Epoch 4581
Loss = 1.4008e-01, PNorm = 569.7211, GNorm = 1.3235, lr_0 = 9.9700e-04
Validation binary_cross_entropy = 0.068341
Epoch 4582
Loss = 2.9833e-02, PNorm = 569.7901, GNorm = 1.9887, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.073921
Epoch 4583
Loss = 2.2202e-02, PNorm = 569.8580, GNorm = 0.1273, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.067831
Epoch 4584
Loss = 2.4131e-02, PNorm = 569.9087, GNorm = 0.3324, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.066094
Epoch 4585
Loss = 3.8930e-02, PNorm = 569.9646, GNorm = 0.6835, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.085451
Epoch 4586
Loss = 6.5708e-03, PNorm = 570.0139, GNorm = 0.6836, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.084895
Epoch 4587
Loss = 5.9385e-03, PNorm = 570.0446, GNorm = 0.0331, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.086533
Epoch 4588
Loss = 2.2898e-03, PNorm = 570.0742, GNorm = 0.0157, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.078228
Epoch 4589
Loss = 3.1165e-02, PNorm = 570.1043, GNorm = 1.5758, lr_0 = 9.9699e-04
Loss = 1.4104e-02, PNorm = 570.1549, GNorm = 0.0301, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.089972
Epoch 4590
Loss = 1.8303e-02, PNorm = 570.2048, GNorm = 0.0297, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.096726
Epoch 4591
Loss = 2.9796e-02, PNorm = 570.2465, GNorm = 1.3761, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.116240
Epoch 4592
Loss = 7.3937e-02, PNorm = 570.2799, GNorm = 0.0761, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.078994
Epoch 4593
Loss = 6.7089e-02, PNorm = 570.3279, GNorm = 1.6824, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.077453
Epoch 4594
Loss = 3.7436e-02, PNorm = 570.3910, GNorm = 0.6826, lr_0 = 9.9699e-04
Validation binary_cross_entropy = 0.072858
Epoch 4595
Loss = 3.3350e-02, PNorm = 570.4321, GNorm = 0.3846, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.073086
Epoch 4596
Loss = 7.0570e-02, PNorm = 570.4664, GNorm = 0.5930, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.062398
Epoch 4597
Loss = 9.1566e-03, PNorm = 570.4991, GNorm = 0.1856, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.073862
Epoch 4598
Loss = 8.5687e-03, PNorm = 570.5616, GNorm = 0.0729, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.084019
Epoch 4599
Loss = 1.0192e-03, PNorm = 570.6028, GNorm = 0.0342, lr_0 = 9.9698e-04
Loss = 4.5338e-02, PNorm = 570.6343, GNorm = 0.2620, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.065760
Epoch 4600
Loss = 1.2601e-02, PNorm = 570.6850, GNorm = 0.5785, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.068192
Epoch 4601
Loss = 9.9574e-03, PNorm = 570.7323, GNorm = 0.1803, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.076556
Epoch 4602
Loss = 1.1899e-02, PNorm = 570.7654, GNorm = 0.6850, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.087877
Epoch 4603
Loss = 4.6175e-03, PNorm = 570.8029, GNorm = 0.2075, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.099159
Epoch 4604
Loss = 1.0215e-02, PNorm = 570.8264, GNorm = 0.0436, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.094580
Epoch 4605
Loss = 4.2456e-02, PNorm = 570.8497, GNorm = 0.3076, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.088847
Epoch 4606
Loss = 2.2404e-02, PNorm = 570.8845, GNorm = 0.3795, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.080068
Epoch 4607
Loss = 1.4186e-02, PNorm = 570.9089, GNorm = 0.6127, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.074478
Epoch 4608
Loss = 8.6735e-03, PNorm = 570.9458, GNorm = 0.3901, lr_0 = 9.9698e-04
Validation binary_cross_entropy = 0.086615
Epoch 4609
Loss = 3.3726e-02, PNorm = 570.9944, GNorm = 1.1246, lr_0 = 9.9697e-04
Loss = 4.7322e-02, PNorm = 571.0291, GNorm = 2.3520, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.082363
Epoch 4610
Loss = 1.3637e-02, PNorm = 571.0580, GNorm = 0.7655, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.076149
Epoch 4611
Loss = 2.4512e-02, PNorm = 571.0957, GNorm = 2.8818, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.080375
Epoch 4612
Loss = 4.0277e-02, PNorm = 571.1380, GNorm = 0.8718, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.094028
Epoch 4613
Loss = 2.3388e-02, PNorm = 571.1688, GNorm = 2.3236, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.086518
Epoch 4614
Loss = 1.0298e-02, PNorm = 571.1950, GNorm = 0.0371, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.084710
Epoch 4615
Loss = 7.9802e-03, PNorm = 571.2278, GNorm = 0.0079, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.098663
Epoch 4616
Loss = 2.1033e-02, PNorm = 571.2647, GNorm = 2.8559, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.077506
Epoch 4617
Loss = 2.9969e-02, PNorm = 571.3050, GNorm = 2.3983, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.072331
Epoch 4618
Loss = 3.1257e-02, PNorm = 571.3560, GNorm = 0.8194, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.074500
Epoch 4619
Loss = 1.2558e-02, PNorm = 571.4078, GNorm = 0.4711, lr_0 = 9.9697e-04
Loss = 3.0982e-02, PNorm = 571.4658, GNorm = 0.1856, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.071936
Epoch 4620
Loss = 3.9043e-02, PNorm = 571.5262, GNorm = 1.0528, lr_0 = 9.9697e-04
Validation binary_cross_entropy = 0.075019
Epoch 4621
Loss = 1.0253e-02, PNorm = 571.5709, GNorm = 0.0580, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.073959
Epoch 4622
Loss = 1.4465e-02, PNorm = 571.6035, GNorm = 0.6785, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.076869
Epoch 4623
Loss = 9.0007e-02, PNorm = 571.6417, GNorm = 0.1531, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.074246
Epoch 4624
Loss = 2.6542e-02, PNorm = 571.6845, GNorm = 1.1947, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.061992
Epoch 4625
Loss = 5.3039e-02, PNorm = 571.7134, GNorm = 0.6532, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.060760
Epoch 4626
Loss = 4.0494e-02, PNorm = 571.7827, GNorm = 1.5585, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.067676
Epoch 4627
Loss = 1.0869e-02, PNorm = 571.8375, GNorm = 0.2733, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.070204
Epoch 4628
Loss = 3.0931e-02, PNorm = 571.8813, GNorm = 1.2434, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.073873
Epoch 4629
Loss = 1.9804e-03, PNorm = 571.9240, GNorm = 0.0582, lr_0 = 9.9696e-04
Loss = 1.8267e-02, PNorm = 571.9523, GNorm = 0.0416, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.076294
Epoch 4630
Loss = 2.9837e-02, PNorm = 571.9810, GNorm = 0.3631, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.075121
Epoch 4631
Loss = 1.3741e-02, PNorm = 572.0263, GNorm = 0.1584, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.084206
Epoch 4632
Loss = 1.2055e-02, PNorm = 572.0808, GNorm = 1.2052, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.109612
Epoch 4633
Loss = 1.7952e-02, PNorm = 572.1168, GNorm = 0.0501, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.096682
Epoch 4634
Loss = 2.6520e-02, PNorm = 572.1470, GNorm = 1.5586, lr_0 = 9.9696e-04
Validation binary_cross_entropy = 0.089823
Epoch 4635
Loss = 6.4793e-03, PNorm = 572.1690, GNorm = 1.0413, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.089969
Epoch 4636
Loss = 1.4912e-02, PNorm = 572.1963, GNorm = 2.6973, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.102671
Epoch 4637
Loss = 1.4957e-02, PNorm = 572.2452, GNorm = 2.5367, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.096515
Epoch 4638
Loss = 1.7808e-03, PNorm = 572.2853, GNorm = 0.1046, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.096441
Epoch 4639
Loss = 1.0981e-02, PNorm = 572.3179, GNorm = 0.7719, lr_0 = 9.9695e-04
Loss = 1.1236e-02, PNorm = 572.3632, GNorm = 0.1210, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.100158
Epoch 4640
Loss = 1.9870e-02, PNorm = 572.4082, GNorm = 2.9620, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.104260
Epoch 4641
Loss = 1.4216e-02, PNorm = 572.4530, GNorm = 0.4892, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.106516
Epoch 4642
Loss = 1.4012e-03, PNorm = 572.4965, GNorm = 0.2580, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.113809
Epoch 4643
Loss = 3.6017e-02, PNorm = 572.5245, GNorm = 0.3991, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.099199
Epoch 4644
Loss = 6.3103e-03, PNorm = 572.5731, GNorm = 0.1789, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.102900
Epoch 4645
Loss = 3.4335e-02, PNorm = 572.6181, GNorm = 0.0039, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.101001
Epoch 4646
Loss = 1.3390e-01, PNorm = 572.6437, GNorm = 0.0500, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.088386
Epoch 4647
Loss = 6.1532e-03, PNorm = 572.6815, GNorm = 0.7969, lr_0 = 9.9695e-04
Validation binary_cross_entropy = 0.084973
Epoch 4648
Loss = 1.2755e-02, PNorm = 572.7286, GNorm = 0.0925, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.089020
Epoch 4649
Loss = 1.0771e-02, PNorm = 572.7871, GNorm = 0.4592, lr_0 = 9.9694e-04
Loss = 1.8238e-02, PNorm = 572.8359, GNorm = 0.5447, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.090408
Epoch 4650
Loss = 7.1553e-03, PNorm = 572.8724, GNorm = 0.0933, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.088510
Epoch 4651
Loss = 1.9672e-02, PNorm = 572.9052, GNorm = 2.7061, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.085869
Epoch 4652
Loss = 9.8193e-03, PNorm = 572.9455, GNorm = 1.4788, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.087582
Epoch 4653
Loss = 1.6537e-02, PNorm = 573.0040, GNorm = 0.2259, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.084103
Epoch 4654
Loss = 1.3028e-02, PNorm = 573.0619, GNorm = 2.3546, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.081474
Epoch 4655
Loss = 7.3614e-03, PNorm = 573.1066, GNorm = 0.2463, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.089415
Epoch 4656
Loss = 5.8083e-03, PNorm = 573.1424, GNorm = 0.0080, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.097718
Epoch 4657
Loss = 2.3715e-01, PNorm = 573.1692, GNorm = 2.7486, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.067078
Epoch 4658
Loss = 1.3775e-01, PNorm = 573.2080, GNorm = 0.3305, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.083807
Epoch 4659
Loss = 8.6800e-03, PNorm = 573.2903, GNorm = 0.5870, lr_0 = 9.9694e-04
Loss = 2.6136e-02, PNorm = 573.3558, GNorm = 1.6526, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.072821
Epoch 4660
Loss = 5.6998e-02, PNorm = 573.4137, GNorm = 3.6371, lr_0 = 9.9694e-04
Validation binary_cross_entropy = 0.067887
Epoch 4661
Loss = 1.6611e-02, PNorm = 573.4829, GNorm = 0.9019, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.068571
Epoch 4662
Loss = 3.1170e-02, PNorm = 573.5527, GNorm = 0.1578, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.072520
Epoch 4663
Loss = 2.9208e-02, PNorm = 573.6202, GNorm = 1.7746, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.067261
Epoch 4664
Loss = 2.9691e-02, PNorm = 573.7085, GNorm = 0.1139, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.082322
Epoch 4665
Loss = 5.9785e-02, PNorm = 573.7872, GNorm = 1.3787, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.076625
Epoch 4666
Loss = 2.4682e-02, PNorm = 573.8664, GNorm = 2.0670, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.093731
Epoch 4667
Loss = 5.0199e-03, PNorm = 573.9326, GNorm = 0.0805, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.079265
Epoch 4668
Loss = 7.2693e-02, PNorm = 573.9743, GNorm = 3.9303, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.071613
Epoch 4669
Loss = 3.0041e-02, PNorm = 574.0282, GNorm = 2.0103, lr_0 = 9.9693e-04
Loss = 1.2715e-02, PNorm = 574.0894, GNorm = 0.1341, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.085171
Epoch 4670
Loss = 4.7637e-03, PNorm = 574.1309, GNorm = 0.0631, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.095712
Epoch 4671
Loss = 5.4526e-03, PNorm = 574.1569, GNorm = 0.2964, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.097421
Epoch 4672
Loss = 2.2870e-02, PNorm = 574.1791, GNorm = 0.0028, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.097917
Epoch 4673
Loss = 1.3415e-03, PNorm = 574.2025, GNorm = 0.0347, lr_0 = 9.9693e-04
Validation binary_cross_entropy = 0.101616
Epoch 4674
Loss = 3.2874e-03, PNorm = 574.2243, GNorm = 0.0431, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.093457
Epoch 4675
Loss = 8.8451e-03, PNorm = 574.2680, GNorm = 0.1453, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.088137
Epoch 4676
Loss = 6.4028e-02, PNorm = 574.3152, GNorm = 1.6048, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.081353
Epoch 4677
Loss = 4.1080e-02, PNorm = 574.3822, GNorm = 0.1453, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.096012
Epoch 4678
Loss = 3.0148e-02, PNorm = 574.4451, GNorm = 2.1651, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.078759
Epoch 4679
Loss = 4.6844e-02, PNorm = 574.5330, GNorm = 3.7000, lr_0 = 9.9692e-04
Loss = 3.5378e-02, PNorm = 574.6441, GNorm = 1.1228, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.109029
Epoch 4680
Loss = 1.5226e-02, PNorm = 574.7108, GNorm = 0.0912, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.087929
Epoch 4681
Loss = 1.9049e-02, PNorm = 574.7559, GNorm = 0.3885, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.081001
Epoch 4682
Loss = 1.5072e-02, PNorm = 574.8039, GNorm = 1.3077, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.097102
Epoch 4683
Loss = 1.5062e-03, PNorm = 574.8464, GNorm = 0.0022, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.112448
Epoch 4684
Loss = 4.8902e-02, PNorm = 574.8871, GNorm = 2.0998, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.093976
Epoch 4685
Loss = 1.7065e-02, PNorm = 574.9575, GNorm = 1.5474, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.125896
Epoch 4686
Loss = 2.3387e-03, PNorm = 575.0348, GNorm = 0.0761, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.128081
Epoch 4687
Loss = 3.0194e-02, PNorm = 575.1107, GNorm = 2.2896, lr_0 = 9.9692e-04
Validation binary_cross_entropy = 0.077641
Epoch 4688
Loss = 1.3082e-01, PNorm = 575.2224, GNorm = 0.8414, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.093098
Epoch 4689
Loss = 2.5347e-02, PNorm = 575.3675, GNorm = 1.4026, lr_0 = 9.9691e-04
Loss = 4.7810e-02, PNorm = 575.4619, GNorm = 0.0932, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.097970
Epoch 4690
Loss = 1.0168e-01, PNorm = 575.5611, GNorm = 0.4880, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.083869
Epoch 4691
Loss = 3.8748e-02, PNorm = 575.7002, GNorm = 1.9134, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.106204
Epoch 4692
Loss = 7.7288e-02, PNorm = 575.8198, GNorm = 1.1794, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.089246
Epoch 4693
Loss = 1.4150e-02, PNorm = 575.9527, GNorm = 1.8767, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.122518
Epoch 4694
Loss = 8.0938e-02, PNorm = 576.0475, GNorm = 0.5813, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.074433
Epoch 4695
Loss = 1.1376e-01, PNorm = 576.1421, GNorm = 0.8980, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.087134
Epoch 4696
Loss = 1.1638e-02, PNorm = 576.2439, GNorm = 0.9781, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.067495
Epoch 4697
Loss = 5.0713e-02, PNorm = 576.3397, GNorm = 3.0270, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.091260
Epoch 4698
Loss = 3.3404e-02, PNorm = 576.4530, GNorm = 0.4284, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.119736
Epoch 4699
Loss = 1.1291e-01, PNorm = 576.5511, GNorm = 4.1573, lr_0 = 9.9691e-04
Loss = 3.9986e-02, PNorm = 576.6515, GNorm = 0.0612, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.100720
Epoch 4700
Loss = 4.3799e-02, PNorm = 576.7582, GNorm = 0.1638, lr_0 = 9.9691e-04
Validation binary_cross_entropy = 0.096322
Epoch 4701
Loss = 1.3311e-01, PNorm = 576.8659, GNorm = 12.2557, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.072512
Epoch 4702
Loss = 3.2669e-02, PNorm = 577.0041, GNorm = 3.8841, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.102031
Epoch 4703
Loss = 4.2737e-02, PNorm = 577.1024, GNorm = 1.0373, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.098647
Epoch 4704
Loss = 1.6983e-02, PNorm = 577.1580, GNorm = 0.0933, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.078159
Epoch 4705
Loss = 4.2573e-02, PNorm = 577.2115, GNorm = 3.1913, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.093306
Epoch 4706
Loss = 7.1588e-03, PNorm = 577.2957, GNorm = 0.0609, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.084452
Epoch 4707
Loss = 1.0264e-02, PNorm = 577.3486, GNorm = 0.3793, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.073731
Epoch 4708
Loss = 9.9635e-02, PNorm = 577.3938, GNorm = 6.7248, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.088336
Epoch 4709
Loss = 4.7296e-02, PNorm = 577.4647, GNorm = 1.3431, lr_0 = 9.9690e-04
Loss = 2.0687e-02, PNorm = 577.5178, GNorm = 1.7810, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.087386
Epoch 4710
Loss = 2.8728e-02, PNorm = 577.5676, GNorm = 0.6952, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.104021
Epoch 4711
Loss = 5.7852e-02, PNorm = 577.6039, GNorm = 1.0645, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.072228
Epoch 4712
Loss = 8.0682e-02, PNorm = 577.6930, GNorm = 1.3646, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.093277
Epoch 4713
Loss = 4.9764e-02, PNorm = 577.7889, GNorm = 1.2437, lr_0 = 9.9690e-04
Validation binary_cross_entropy = 0.090728
Epoch 4714
Loss = 1.0137e-02, PNorm = 577.8446, GNorm = 0.3566, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.080780
Epoch 4715
Loss = 1.5798e-02, PNorm = 577.8826, GNorm = 0.3341, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.081325
Epoch 4716
Loss = 3.7501e-03, PNorm = 577.9385, GNorm = 0.0608, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.086700
Epoch 4717
Loss = 4.4223e-02, PNorm = 577.9856, GNorm = 0.5752, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.076418
Epoch 4718
Loss = 3.7494e-02, PNorm = 578.0456, GNorm = 0.8338, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.079402
Epoch 4719
Loss = 5.5046e-02, PNorm = 578.1095, GNorm = 1.3900, lr_0 = 9.9689e-04
Loss = 1.6707e-02, PNorm = 578.1638, GNorm = 0.4335, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.082414
Epoch 4720
Loss = 3.0809e-02, PNorm = 578.2023, GNorm = 0.6886, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.071461
Epoch 4721
Loss = 9.0353e-03, PNorm = 578.2515, GNorm = 0.4002, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.080570
Epoch 4722
Loss = 7.7843e-02, PNorm = 578.3102, GNorm = 0.2628, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.085844
Epoch 4723
Loss = 1.3320e-02, PNorm = 578.3599, GNorm = 0.3249, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.089889
Epoch 4724
Loss = 5.4263e-03, PNorm = 578.4021, GNorm = 0.0215, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.092236
Epoch 4725
Loss = 1.2047e-02, PNorm = 578.4330, GNorm = 0.1271, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.106088
Epoch 4726
Loss = 2.5917e-02, PNorm = 578.4799, GNorm = 4.4227, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.107188
Epoch 4727
Loss = 1.7630e-02, PNorm = 578.5230, GNorm = 0.0314, lr_0 = 9.9689e-04
Validation binary_cross_entropy = 0.109493
Epoch 4728
Loss = 1.0056e-03, PNorm = 578.5547, GNorm = 0.0221, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.095974
Epoch 4729
Loss = 5.3240e-02, PNorm = 578.5865, GNorm = 1.7295, lr_0 = 9.9688e-04
Loss = 5.2927e-02, PNorm = 578.6231, GNorm = 3.1841, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.089285
Epoch 4730
Loss = 7.6857e-03, PNorm = 578.6778, GNorm = 0.0859, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.102517
Epoch 4731
Loss = 2.4738e-02, PNorm = 578.7114, GNorm = 0.0444, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.089129
Epoch 4732
Loss = 2.1560e-02, PNorm = 578.7416, GNorm = 0.0282, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.085045
Epoch 4733
Loss = 3.8350e-02, PNorm = 578.7726, GNorm = 0.3184, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.084151
Epoch 4734
Loss = 2.2664e-02, PNorm = 578.8132, GNorm = 1.3941, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.090513
Epoch 4735
Loss = 2.3241e-02, PNorm = 578.8547, GNorm = 0.1495, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.107463
Epoch 4736
Loss = 1.5002e-01, PNorm = 578.9007, GNorm = 3.8843, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.080683
Epoch 4737
Loss = 2.0434e-01, PNorm = 578.9567, GNorm = 14.4725, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.074480
Epoch 4738
Loss = 1.9095e-02, PNorm = 579.0273, GNorm = 0.1719, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.071952
Epoch 4739
Loss = 6.4821e-02, PNorm = 579.0866, GNorm = 1.6624, lr_0 = 9.9688e-04
Loss = 4.9659e-02, PNorm = 579.1474, GNorm = 0.7842, lr_0 = 9.9688e-04
Validation binary_cross_entropy = 0.071197
Epoch 4740
Loss = 2.1319e-02, PNorm = 579.2135, GNorm = 1.0413, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.078568
Epoch 4741
Loss = 2.5981e-02, PNorm = 579.2645, GNorm = 0.0214, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.080939
Epoch 4742
Loss = 1.8149e-02, PNorm = 579.3142, GNorm = 0.1492, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.088832
Epoch 4743
Loss = 1.7070e-02, PNorm = 579.3513, GNorm = 0.0173, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.100767
Epoch 4744
Loss = 2.8602e-03, PNorm = 579.3969, GNorm = 0.0390, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.138706
Epoch 4745
Loss = 1.8924e-02, PNorm = 579.4233, GNorm = 0.0096, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.095580
Epoch 4746
Loss = 2.6229e-02, PNorm = 579.4513, GNorm = 1.9077, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.103273
Epoch 4747
Loss = 4.1670e-02, PNorm = 579.5123, GNorm = 1.7208, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.102608
Epoch 4748
Loss = 5.9414e-03, PNorm = 579.5597, GNorm = 0.0857, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.093764
Epoch 4749
Loss = 3.2175e-02, PNorm = 579.6090, GNorm = 1.5199, lr_0 = 9.9687e-04
Loss = 3.3861e-02, PNorm = 579.6643, GNorm = 0.1865, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.082251
Epoch 4750
Loss = 1.4244e-02, PNorm = 579.7355, GNorm = 0.2007, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.089797
Epoch 4751
Loss = 2.1030e-02, PNorm = 579.7883, GNorm = 1.0801, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.101679
Epoch 4752
Loss = 1.6080e-02, PNorm = 579.8200, GNorm = 0.0131, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.091930
Epoch 4753
Loss = 4.5734e-02, PNorm = 579.8963, GNorm = 4.4875, lr_0 = 9.9687e-04
Validation binary_cross_entropy = 0.103391
Epoch 4754
Loss = 5.0689e-02, PNorm = 579.9940, GNorm = 3.0518, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.099895
Epoch 4755
Loss = 1.7579e-02, PNorm = 580.0595, GNorm = 0.9589, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.117215
Epoch 4756
Loss = 1.0719e+00, PNorm = 580.1347, GNorm = 0.8246, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.066905
Epoch 4757
Loss = 9.4928e-02, PNorm = 580.2434, GNorm = 4.5816, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.135128
Epoch 4758
Loss = 1.0288e-01, PNorm = 580.3401, GNorm = 1.3252, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.065932
Epoch 4759
Loss = 1.3327e-01, PNorm = 580.4299, GNorm = 4.7157, lr_0 = 9.9686e-04
Loss = 7.6656e-02, PNorm = 580.5102, GNorm = 0.7459, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.080413
Epoch 4760
Loss = 1.0060e-01, PNorm = 580.5873, GNorm = 3.0556, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.089828
Epoch 4761
Loss = 4.7175e-02, PNorm = 580.6553, GNorm = 1.6879, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.073475
Epoch 4762
Loss = 3.9426e-02, PNorm = 580.7231, GNorm = 0.4988, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.080535
Epoch 4763
Loss = 4.3143e-02, PNorm = 580.7970, GNorm = 0.2056, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.093675
Epoch 4764
Loss = 2.4748e-02, PNorm = 580.8557, GNorm = 0.8599, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.107612
Epoch 4765
Loss = 5.1070e-02, PNorm = 580.9183, GNorm = 0.1648, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.075987
Epoch 4766
Loss = 2.0341e-02, PNorm = 580.9842, GNorm = 1.0360, lr_0 = 9.9686e-04
Validation binary_cross_entropy = 0.100508
Epoch 4767
Loss = 1.5662e-02, PNorm = 581.0452, GNorm = 0.1574, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.088999
Epoch 4768
Loss = 7.4361e-03, PNorm = 581.1092, GNorm = 0.3673, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.082169
Epoch 4769
Loss = 5.6691e-02, PNorm = 581.1627, GNorm = 2.0595, lr_0 = 9.9685e-04
Loss = 4.9684e-02, PNorm = 581.2211, GNorm = 0.6351, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.089034
Epoch 4770
Loss = 5.1614e-02, PNorm = 581.2871, GNorm = 0.1777, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.100408
Epoch 4771
Loss = 3.8500e-02, PNorm = 581.3622, GNorm = 0.6342, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.090093
Epoch 4772
Loss = 2.8018e-02, PNorm = 581.4411, GNorm = 0.0409, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.126226
Epoch 4773
Loss = 9.7436e-02, PNorm = 581.4956, GNorm = 0.1207, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.104124
Epoch 4774
Loss = 1.1059e-02, PNorm = 581.5548, GNorm = 0.0727, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.097649
Epoch 4775
Loss = 2.0215e-02, PNorm = 581.5998, GNorm = 0.4327, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.080085
Epoch 4776
Loss = 7.5326e-02, PNorm = 581.6683, GNorm = 0.1419, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.074949
Epoch 4777
Loss = 1.2535e-02, PNorm = 581.7374, GNorm = 0.1400, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.096961
Epoch 4778
Loss = 1.9168e-02, PNorm = 581.7988, GNorm = 1.8386, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.111890
Epoch 4779
Loss = 4.6050e-02, PNorm = 581.8390, GNorm = 2.6625, lr_0 = 9.9685e-04
Loss = 5.7696e-02, PNorm = 581.8738, GNorm = 0.1020, lr_0 = 9.9685e-04
Validation binary_cross_entropy = 0.075631
Epoch 4780
Loss = 2.6777e-02, PNorm = 581.9291, GNorm = 0.5730, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.087451
Epoch 4781
Loss = 1.7532e-02, PNorm = 581.9758, GNorm = 0.0444, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.098500
Epoch 4782
Loss = 2.0907e-02, PNorm = 582.0117, GNorm = 1.2864, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.093556
Epoch 4783
Loss = 1.2447e-02, PNorm = 582.0512, GNorm = 0.5940, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.095301
Epoch 4784
Loss = 3.9143e-03, PNorm = 582.0922, GNorm = 0.8734, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.101968
Epoch 4785
Loss = 1.2240e-02, PNorm = 582.1425, GNorm = 0.4771, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.181431
Epoch 4786
Loss = 1.4383e-01, PNorm = 582.1965, GNorm = 0.9980, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.182762
Epoch 4787
Loss = 9.0900e-03, PNorm = 582.2330, GNorm = 0.0168, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.153652
Epoch 4788
Loss = 1.7986e-02, PNorm = 582.2693, GNorm = 1.3356, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.131168
Epoch 4789
Loss = 1.0168e-02, PNorm = 582.2984, GNorm = 0.6832, lr_0 = 9.9684e-04
Loss = 4.6041e-02, PNorm = 582.3405, GNorm = 0.1522, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.105616
Epoch 4790
Loss = 1.7647e-02, PNorm = 582.3881, GNorm = 0.4820, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.108812
Epoch 4791
Loss = 2.7653e-02, PNorm = 582.4236, GNorm = 0.4493, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.102313
Epoch 4792
Loss = 1.0603e-02, PNorm = 582.4608, GNorm = 0.0190, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.109152
Epoch 4793
Loss = 1.5741e-02, PNorm = 582.5021, GNorm = 0.8915, lr_0 = 9.9684e-04
Validation binary_cross_entropy = 0.128828
Epoch 4794
Loss = 3.2295e-02, PNorm = 582.5444, GNorm = 0.0113, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.133915
Epoch 4795
Loss = 6.3216e-02, PNorm = 582.5696, GNorm = 0.0604, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.078606
Epoch 4796
Loss = 5.1789e-02, PNorm = 582.6291, GNorm = 0.1454, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.074327
Epoch 4797
Loss = 6.5287e-02, PNorm = 582.6917, GNorm = 5.4536, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.055091
Epoch 4798
Loss = 1.6523e-02, PNorm = 582.7614, GNorm = 0.9589, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.075232
Epoch 4799
Loss = 9.0147e-02, PNorm = 582.8422, GNorm = 3.3531, lr_0 = 9.9683e-04
Loss = 4.2984e-02, PNorm = 582.8979, GNorm = 0.0601, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.084065
Epoch 4800
Loss = 3.2167e-02, PNorm = 582.9341, GNorm = 0.2323, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.070997
Epoch 4801
Loss = 5.6986e-02, PNorm = 582.9695, GNorm = 1.4701, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.068365
Epoch 4802
Loss = 2.3614e-02, PNorm = 583.0249, GNorm = 0.0711, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.093940
Epoch 4803
Loss = 5.1125e-02, PNorm = 583.0692, GNorm = 1.6000, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.077074
Epoch 4804
Loss = 2.6980e-02, PNorm = 583.1066, GNorm = 1.2271, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.079988
Epoch 4805
Loss = 3.0407e-02, PNorm = 583.1585, GNorm = 1.6942, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.098207
Epoch 4806
Loss = 1.8157e-03, PNorm = 583.2170, GNorm = 0.0856, lr_0 = 9.9683e-04
Validation binary_cross_entropy = 0.111748
Epoch 4807
Loss = 1.0521e-01, PNorm = 583.2552, GNorm = 0.4579, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.107351
Epoch 4808
Loss = 4.8023e-03, PNorm = 583.2829, GNorm = 0.1676, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.116788
Epoch 4809
Loss = 1.0600e-02, PNorm = 583.3125, GNorm = 0.5288, lr_0 = 9.9682e-04
Loss = 2.1003e-02, PNorm = 583.3399, GNorm = 1.1294, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.111915
Epoch 4810
Loss = 1.3396e-02, PNorm = 583.3663, GNorm = 0.0598, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.096316
Epoch 4811
Loss = 2.0383e-02, PNorm = 583.3939, GNorm = 2.4020, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.100863
Epoch 4812
Loss = 4.1409e-02, PNorm = 583.4368, GNorm = 0.0531, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.093561
Epoch 4813
Loss = 9.4779e-03, PNorm = 583.4800, GNorm = 0.2568, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.088676
Epoch 4814
Loss = 1.1051e-02, PNorm = 583.5387, GNorm = 0.0938, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.114232
Epoch 4815
Loss = 2.3142e-02, PNorm = 583.5733, GNorm = 0.8293, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.092502
Epoch 4816
Loss = 4.4386e-02, PNorm = 583.6177, GNorm = 1.6998, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.079331
Epoch 4817
Loss = 1.8816e-02, PNorm = 583.7052, GNorm = 1.1129, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.082909
Epoch 4818
Loss = 9.7966e-02, PNorm = 583.7715, GNorm = 1.1293, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.065249
Epoch 4819
Loss = 1.2207e-02, PNorm = 583.8209, GNorm = 0.2989, lr_0 = 9.9682e-04
Loss = 2.2282e-02, PNorm = 583.8767, GNorm = 0.9023, lr_0 = 9.9682e-04
Validation binary_cross_entropy = 0.100263
Epoch 4820
Loss = 5.7408e-02, PNorm = 583.9114, GNorm = 0.8690, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.079447
Epoch 4821
Loss = 1.6630e-02, PNorm = 583.9540, GNorm = 0.1423, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.093079
Epoch 4822
Loss = 1.1912e-02, PNorm = 583.9898, GNorm = 0.2419, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.092388
Epoch 4823
Loss = 1.7278e-02, PNorm = 584.0268, GNorm = 0.0788, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.086539
Epoch 4824
Loss = 3.7071e-02, PNorm = 584.0667, GNorm = 0.0585, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.087970
Epoch 4825
Loss = 1.3139e-02, PNorm = 584.1121, GNorm = 0.7396, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.081224
Epoch 4826
Loss = 2.7297e-02, PNorm = 584.1507, GNorm = 1.7689, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.086933
Epoch 4827
Loss = 1.2440e-02, PNorm = 584.2092, GNorm = 0.0800, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.086727
Epoch 4828
Loss = 1.6429e-02, PNorm = 584.2478, GNorm = 0.6309, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.098039
Epoch 4829
Loss = 5.0946e-03, PNorm = 584.2803, GNorm = 0.4643, lr_0 = 9.9681e-04
Loss = 6.4741e-02, PNorm = 584.3032, GNorm = 3.2305, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.080706
Epoch 4830
Loss = 8.5973e-03, PNorm = 584.3397, GNorm = 0.1455, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.081697
Epoch 4831
Loss = 2.5858e-02, PNorm = 584.3810, GNorm = 3.5650, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.094641
Epoch 4832
Loss = 4.6134e-02, PNorm = 584.4302, GNorm = 2.1368, lr_0 = 9.9681e-04
Validation binary_cross_entropy = 0.097661
Epoch 4833
Loss = 3.1529e-02, PNorm = 584.4765, GNorm = 1.3629, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.081528
Epoch 4834
Loss = 5.7957e-02, PNorm = 584.5102, GNorm = 0.1404, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.075571
Epoch 4835
Loss = 3.0971e-02, PNorm = 584.5610, GNorm = 0.1221, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.087001
Epoch 4836
Loss = 1.5607e-02, PNorm = 584.6380, GNorm = 0.4991, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.081948
Epoch 4837
Loss = 4.3496e-03, PNorm = 584.6882, GNorm = 0.2640, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.079009
Epoch 4838
Loss = 2.0115e-02, PNorm = 584.7398, GNorm = 1.6334, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.086090
Epoch 4839
Loss = 5.3388e-03, PNorm = 584.8012, GNorm = 0.1565, lr_0 = 9.9680e-04
Loss = 3.4944e-02, PNorm = 584.8432, GNorm = 0.0814, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.086627
Epoch 4840
Loss = 2.4428e-02, PNorm = 584.8834, GNorm = 2.3941, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.087168
Epoch 4841
Loss = 1.8354e-02, PNorm = 584.9385, GNorm = 0.2197, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.088223
Epoch 4842
Loss = 5.4729e-02, PNorm = 584.9781, GNorm = 1.1052, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.069166
Epoch 4843
Loss = 1.7265e-02, PNorm = 585.0258, GNorm = 1.6689, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.077701
Epoch 4844
Loss = 1.2487e-01, PNorm = 585.0803, GNorm = 0.7194, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.074844
Epoch 4845
Loss = 1.6158e-02, PNorm = 585.1330, GNorm = 0.0931, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.076503
Epoch 4846
Loss = 2.8102e-02, PNorm = 585.1937, GNorm = 0.4295, lr_0 = 9.9680e-04
Validation binary_cross_entropy = 0.080852
Epoch 4847
Loss = 6.7657e-02, PNorm = 585.2485, GNorm = 1.8397, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.074654
Epoch 4848
Loss = 2.9355e-02, PNorm = 585.3006, GNorm = 1.8939, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.069667
Epoch 4849
Loss = 1.5394e-02, PNorm = 585.3641, GNorm = 0.6738, lr_0 = 9.9679e-04
Loss = 1.4181e-02, PNorm = 585.4137, GNorm = 0.2289, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.078993
Epoch 4850
Loss = 7.6931e-03, PNorm = 585.4511, GNorm = 0.4392, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.111993
Epoch 4851
Loss = 3.6195e-02, PNorm = 585.4894, GNorm = 3.2552, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.189435
Epoch 4852
Loss = 3.5874e-02, PNorm = 585.5194, GNorm = 0.0329, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.085994
Epoch 4853
Loss = 5.0098e-01, PNorm = 585.5625, GNorm = 0.3059, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.111060
Epoch 4854
Loss = 9.3528e-02, PNorm = 585.6315, GNorm = 1.5454, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.056559
Epoch 4855
Loss = 5.7274e-02, PNorm = 585.7201, GNorm = 2.4505, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.105012
Epoch 4856
Loss = 2.0646e-02, PNorm = 585.7900, GNorm = 0.4117, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.076907
Epoch 4857
Loss = 2.4122e-02, PNorm = 585.8501, GNorm = 0.2840, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.086950
Epoch 4858
Loss = 2.9608e-02, PNorm = 585.9438, GNorm = 1.8203, lr_0 = 9.9679e-04
Validation binary_cross_entropy = 0.076825
Epoch 4859
Loss = 7.7157e-03, PNorm = 586.0283, GNorm = 0.4093, lr_0 = 9.9679e-04
Loss = 4.8105e-02, PNorm = 586.0920, GNorm = 4.1734, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.109789
Epoch 4860
Loss = 5.2491e-02, PNorm = 586.1474, GNorm = 0.4439, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.068793
Epoch 4861
Loss = 3.5688e-02, PNorm = 586.2280, GNorm = 0.2446, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.081342
Epoch 4862
Loss = 2.5714e-02, PNorm = 586.2964, GNorm = 1.5365, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.080562
Epoch 4863
Loss = 1.7944e-02, PNorm = 586.3506, GNorm = 0.1875, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.075950
Epoch 4864
Loss = 1.7541e-02, PNorm = 586.4071, GNorm = 0.7159, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.086740
Epoch 4865
Loss = 7.0575e-02, PNorm = 586.4495, GNorm = 1.4488, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.076648
Epoch 4866
Loss = 5.1374e-02, PNorm = 586.4932, GNorm = 0.7986, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.079662
Epoch 4867
Loss = 5.1248e-02, PNorm = 586.5521, GNorm = 1.8940, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.076176
Epoch 4868
Loss = 1.9319e-02, PNorm = 586.6002, GNorm = 0.1454, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.084153
Epoch 4869
Loss = 2.7826e-03, PNorm = 586.6553, GNorm = 0.1826, lr_0 = 9.9678e-04
Loss = 4.3688e-02, PNorm = 586.6927, GNorm = 0.3338, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.075979
Epoch 4870
Loss = 1.7528e-02, PNorm = 586.7331, GNorm = 1.8879, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.076308
Epoch 4871
Loss = 1.0526e-02, PNorm = 586.7892, GNorm = 0.0811, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.075251
Epoch 4872
Loss = 1.2224e-02, PNorm = 586.8527, GNorm = 0.0305, lr_0 = 9.9678e-04
Validation binary_cross_entropy = 0.078850
Epoch 4873
Loss = 3.4209e-02, PNorm = 586.8950, GNorm = 0.0406, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.080404
Epoch 4874
Loss = 6.0219e-02, PNorm = 586.9508, GNorm = 1.5571, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.097956
Epoch 4875
Loss = 2.3142e-02, PNorm = 587.0076, GNorm = 0.3244, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.087565
Epoch 4876
Loss = 1.0722e-01, PNorm = 587.0532, GNorm = 4.3938, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.092344
Epoch 4877
Loss = 7.7193e-02, PNorm = 587.1184, GNorm = 5.0324, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.089219
Epoch 4878
Loss = 3.4148e-02, PNorm = 587.1744, GNorm = 1.7469, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.070918
Epoch 4879
Loss = 4.6636e-02, PNorm = 587.2270, GNorm = 1.4916, lr_0 = 9.9677e-04
Loss = 2.3219e-02, PNorm = 587.3003, GNorm = 2.1119, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.084951
Epoch 4880
Loss = 1.8267e-02, PNorm = 587.3684, GNorm = 0.5652, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.091848
Epoch 4881
Loss = 5.3550e-02, PNorm = 587.4138, GNorm = 0.0172, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.096834
Epoch 4882
Loss = 5.4032e-02, PNorm = 587.4561, GNorm = 4.7423, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.081635
Epoch 4883
Loss = 2.6055e-02, PNorm = 587.5165, GNorm = 0.5016, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.077664
Epoch 4884
Loss = 5.5358e-02, PNorm = 587.5918, GNorm = 1.8577, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.077211
Epoch 4885
Loss = 5.1759e-03, PNorm = 587.6733, GNorm = 0.2291, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.089463
Epoch 4886
Loss = 2.7117e-02, PNorm = 587.7228, GNorm = 0.0285, lr_0 = 9.9677e-04
Validation binary_cross_entropy = 0.083763
Epoch 4887
Loss = 4.0562e-02, PNorm = 587.7543, GNorm = 0.2161, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.083344
Epoch 4888
Loss = 2.7850e-02, PNorm = 587.8099, GNorm = 0.5516, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.090171
Epoch 4889
Loss = 4.1622e-02, PNorm = 587.8666, GNorm = 1.0673, lr_0 = 9.9676e-04
Loss = 5.3842e-02, PNorm = 587.9198, GNorm = 0.3319, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.070697
Epoch 4890
Loss = 3.1616e-02, PNorm = 588.0080, GNorm = 0.6807, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.109953
Epoch 4891
Loss = 2.0444e-02, PNorm = 588.0593, GNorm = 0.0658, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.104594
Epoch 4892
Loss = 2.4351e-02, PNorm = 588.0958, GNorm = 0.0425, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.103695
Epoch 4893
Loss = 3.4402e-02, PNorm = 588.1338, GNorm = 2.0516, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.102194
Epoch 4894
Loss = 1.5696e-02, PNorm = 588.1715, GNorm = 0.1901, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.077307
Epoch 4895
Loss = 4.9764e-02, PNorm = 588.2317, GNorm = 1.4168, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.073376
Epoch 4896
Loss = 3.1031e-02, PNorm = 588.3218, GNorm = 0.7276, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.088871
Epoch 4897
Loss = 4.7485e-02, PNorm = 588.3799, GNorm = 0.2211, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.076495
Epoch 4898
Loss = 3.9309e-02, PNorm = 588.4223, GNorm = 0.2592, lr_0 = 9.9676e-04
Validation binary_cross_entropy = 0.070501
Epoch 4899
Loss = 2.3880e-02, PNorm = 588.4587, GNorm = 1.2050, lr_0 = 9.9676e-04
Loss = 2.7030e-02, PNorm = 588.5103, GNorm = 0.0909, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.082551
Epoch 4900
Loss = 6.4136e-02, PNorm = 588.5544, GNorm = 5.2921, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.078313
Epoch 4901
Loss = 5.0165e-02, PNorm = 588.6125, GNorm = 4.8423, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.070247
Epoch 4902
Loss = 2.0395e-02, PNorm = 588.6769, GNorm = 0.6511, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.067006
Epoch 4903
Loss = 1.2680e-01, PNorm = 588.7542, GNorm = 0.8710, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.052127
Epoch 4904
Loss = 4.5966e-02, PNorm = 588.8455, GNorm = 0.3021, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.067168
Epoch 4905
Loss = 3.3402e-02, PNorm = 588.9322, GNorm = 1.3475, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.065347
Epoch 4906
Loss = 3.4880e-02, PNorm = 588.9883, GNorm = 1.0393, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.071995
Epoch 4907
Loss = 2.8788e-02, PNorm = 589.0325, GNorm = 1.0049, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.075534
Epoch 4908
Loss = 3.6025e-03, PNorm = 589.0751, GNorm = 0.1511, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.061021
Epoch 4909
Loss = 4.4487e-02, PNorm = 589.1269, GNorm = 2.7652, lr_0 = 9.9675e-04
Loss = 1.5075e-02, PNorm = 589.2022, GNorm = 0.2261, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.071853
Epoch 4910
Loss = 3.3838e-02, PNorm = 589.2576, GNorm = 0.7612, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.077546
Epoch 4911
Loss = 3.5571e-02, PNorm = 589.2963, GNorm = 3.8178, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.067567
Epoch 4912
Loss = 3.9259e-02, PNorm = 589.3361, GNorm = 0.2448, lr_0 = 9.9675e-04
Validation binary_cross_entropy = 0.081419
Epoch 4913
Loss = 2.0272e-02, PNorm = 589.3730, GNorm = 0.2568, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.071302
Epoch 4914
Loss = 2.4299e-02, PNorm = 589.4075, GNorm = 0.0525, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.069469
Epoch 4915
Loss = 9.4772e-03, PNorm = 589.4471, GNorm = 0.5354, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.079939
Epoch 4916
Loss = 4.8902e-03, PNorm = 589.4866, GNorm = 0.4943, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.089664
Epoch 4917
Loss = 4.4000e-02, PNorm = 589.5133, GNorm = 1.5182, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.077614
Epoch 4918
Loss = 1.2155e-02, PNorm = 589.5404, GNorm = 0.4059, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.082356
Epoch 4919
Loss = 1.3334e-02, PNorm = 589.5748, GNorm = 1.0425, lr_0 = 9.9674e-04
Loss = 8.7539e-03, PNorm = 589.6060, GNorm = 0.0162, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.086847
Epoch 4920
Loss = 1.1586e-02, PNorm = 589.6343, GNorm = 0.0336, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.105952
Epoch 4921
Loss = 1.9107e-02, PNorm = 589.6628, GNorm = 2.1847, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.095054
Epoch 4922
Loss = 6.7621e-02, PNorm = 589.7063, GNorm = 7.3791, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.074282
Epoch 4923
Loss = 2.6808e-02, PNorm = 589.7880, GNorm = 0.1213, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.085231
Epoch 4924
Loss = 8.2185e-03, PNorm = 589.8455, GNorm = 0.2377, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.081254
Epoch 4925
Loss = 4.4051e-02, PNorm = 589.8845, GNorm = 0.3155, lr_0 = 9.9674e-04
Validation binary_cross_entropy = 0.075749
Epoch 4926
Loss = 4.3780e-02, PNorm = 589.9312, GNorm = 1.2032, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.078911
Epoch 4927
Loss = 1.3972e-02, PNorm = 589.9827, GNorm = 1.5586, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.076434
Epoch 4928
Loss = 6.8570e-03, PNorm = 590.0331, GNorm = 0.2448, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.094398
Epoch 4929
Loss = 1.1514e-01, PNorm = 590.0913, GNorm = 1.2053, lr_0 = 9.9673e-04
Loss = 2.8527e-02, PNorm = 590.1340, GNorm = 2.8596, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.106109
Epoch 4930
Loss = 4.7490e-02, PNorm = 590.1823, GNorm = 1.0301, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.089947
Epoch 4931
Loss = 1.4783e-02, PNorm = 590.2288, GNorm = 0.8877, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.077487
Epoch 4932
Loss = 4.4293e-02, PNorm = 590.2802, GNorm = 0.0696, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.082015
Epoch 4933
Loss = 1.3388e-02, PNorm = 590.3398, GNorm = 0.5144, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.104063
Epoch 4934
Loss = 3.8974e-02, PNorm = 590.3748, GNorm = 1.2846, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.082760
Epoch 4935
Loss = 9.5236e-03, PNorm = 590.4075, GNorm = 0.0982, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.090789
Epoch 4936
Loss = 9.2647e-03, PNorm = 590.4506, GNorm = 0.9564, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.096004
Epoch 4937
Loss = 2.3204e-02, PNorm = 590.4923, GNorm = 0.4868, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.089160
Epoch 4938
Loss = 6.0469e-02, PNorm = 590.5422, GNorm = 1.3283, lr_0 = 9.9673e-04
Validation binary_cross_entropy = 0.105867
Epoch 4939
Loss = 2.2898e-04, PNorm = 590.6150, GNorm = 0.0287, lr_0 = 9.9673e-04
Loss = 3.3534e-02, PNorm = 590.6620, GNorm = 2.7189, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.094623
Epoch 4940
Loss = 2.4207e-03, PNorm = 590.7025, GNorm = 0.0819, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.097392
Epoch 4941
Loss = 1.0359e-02, PNorm = 590.7403, GNorm = 2.6767, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.110476
Epoch 4942
Loss = 1.9589e-02, PNorm = 590.7808, GNorm = 0.1906, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.099338
Epoch 4943
Loss = 4.5591e-02, PNorm = 590.8155, GNorm = 2.0657, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.085204
Epoch 4944
Loss = 3.7505e-03, PNorm = 590.8545, GNorm = 0.1844, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.096173
Epoch 4945
Loss = 1.1645e-02, PNorm = 590.8993, GNorm = 0.0332, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.094458
Epoch 4946
Loss = 4.9617e-03, PNorm = 590.9322, GNorm = 0.1537, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.094218
Epoch 4947
Loss = 3.1099e-02, PNorm = 590.9685, GNorm = 5.3341, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.109662
Epoch 4948
Loss = 6.6557e-02, PNorm = 591.0096, GNorm = 0.0233, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.100723
Epoch 4949
Loss = 7.7933e-03, PNorm = 591.0467, GNorm = 0.6204, lr_0 = 9.9672e-04
Loss = 2.8737e-02, PNorm = 591.0874, GNorm = 1.5268, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.112797
Epoch 4950
Loss = 3.9531e-02, PNorm = 591.1159, GNorm = 0.1281, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.096955
Epoch 4951
Loss = 3.4518e-02, PNorm = 591.1786, GNorm = 0.1771, lr_0 = 9.9672e-04
Validation binary_cross_entropy = 0.108910
Epoch 4952
Loss = 7.4417e-02, PNorm = 591.3046, GNorm = 0.7988, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.139831
Epoch 4953
Loss = 1.0871e-02, PNorm = 591.4400, GNorm = 0.0145, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.162678
Epoch 4954
Loss = 2.9079e-02, PNorm = 591.5256, GNorm = 0.0369, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.138349
Epoch 4955
Loss = 2.9782e-02, PNorm = 591.6019, GNorm = 1.9301, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.131166
Epoch 4956
Loss = 7.9559e-02, PNorm = 591.6711, GNorm = 0.2130, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.092937
Epoch 4957
Loss = 2.0633e-02, PNorm = 591.7388, GNorm = 1.0272, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.082989
Epoch 4958
Loss = 6.9229e-02, PNorm = 591.8512, GNorm = 1.9636, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.115226
Epoch 4959
Loss = 1.9957e-02, PNorm = 591.9987, GNorm = 1.0760, lr_0 = 9.9671e-04
Loss = 7.4919e-02, PNorm = 592.1322, GNorm = 2.1282, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.083253
Epoch 4960
Loss = 7.6531e-02, PNorm = 592.2399, GNorm = 1.8062, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.105548
Epoch 4961
Loss = 5.1676e-02, PNorm = 592.3487, GNorm = 0.7717, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.103729
Epoch 4962
Loss = 3.9977e-02, PNorm = 592.4408, GNorm = 3.0121, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.112869
Epoch 4963
Loss = 5.2315e-02, PNorm = 592.5285, GNorm = 0.1851, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.092773
Epoch 4964
Loss = 8.8930e-02, PNorm = 592.6214, GNorm = 0.0353, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.114015
Epoch 4965
Loss = 5.4368e-02, PNorm = 592.6980, GNorm = 0.1086, lr_0 = 9.9671e-04
Validation binary_cross_entropy = 0.095078
Epoch 4966
Loss = 3.8404e-02, PNorm = 592.7536, GNorm = 0.2167, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.082342
Epoch 4967
Loss = 2.1991e-02, PNorm = 592.8062, GNorm = 0.5547, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.081760
Epoch 4968
Loss = 1.4989e-02, PNorm = 592.8577, GNorm = 0.5186, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.104434
Epoch 4969
Loss = 7.0724e-03, PNorm = 592.9216, GNorm = 0.4053, lr_0 = 9.9670e-04
Loss = 1.3709e-02, PNorm = 592.9784, GNorm = 0.1169, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.138453
Epoch 4970
Loss = 7.0196e-02, PNorm = 593.0176, GNorm = 1.2281, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.127677
Epoch 4971
Loss = 6.3382e-02, PNorm = 593.0671, GNorm = 0.7864, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.073121
Epoch 4972
Loss = 2.4965e-02, PNorm = 593.1153, GNorm = 1.3733, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.070288
Epoch 4973
Loss = 1.6191e-02, PNorm = 593.1576, GNorm = 0.1547, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.077096
Epoch 4974
Loss = 3.6685e-02, PNorm = 593.1992, GNorm = 1.4648, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.069567
Epoch 4975
Loss = 2.8292e-02, PNorm = 593.2513, GNorm = 1.1325, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.077410
Epoch 4976
Loss = 5.2029e-02, PNorm = 593.3040, GNorm = 0.2759, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.096345
Epoch 4977
Loss = 3.9031e-02, PNorm = 593.3585, GNorm = 0.2045, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.083149
Epoch 4978
Loss = 4.9428e-02, PNorm = 593.4323, GNorm = 3.0650, lr_0 = 9.9670e-04
Validation binary_cross_entropy = 0.112481
Epoch 4979
Loss = 5.2898e-03, PNorm = 593.4988, GNorm = 0.3197, lr_0 = 9.9670e-04
Loss = 5.6829e-02, PNorm = 593.5372, GNorm = 0.2650, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.090535
Epoch 4980
Loss = 7.4866e-02, PNorm = 593.5883, GNorm = 0.6043, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.085932
Epoch 4981
Loss = 2.5869e-02, PNorm = 593.6619, GNorm = 0.1021, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.098203
Epoch 4982
Loss = 1.6769e-02, PNorm = 593.7117, GNorm = 0.0760, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.090937
Epoch 4983
Loss = 1.0951e-02, PNorm = 593.7492, GNorm = 0.7051, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.115122
Epoch 4984
Loss = 6.4692e-02, PNorm = 593.7798, GNorm = 1.9811, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.103531
Epoch 4985
Loss = 4.0188e-02, PNorm = 593.8154, GNorm = 0.1266, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.101684
Epoch 4986
Loss = 3.8412e-02, PNorm = 593.8668, GNorm = 0.9126, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.114832
Epoch 4987
Loss = 1.1630e-02, PNorm = 593.9057, GNorm = 1.5793, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.103440
Epoch 4988
Loss = 3.1539e-02, PNorm = 593.9355, GNorm = 2.0740, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.116254
Epoch 4989
Loss = 4.2150e-03, PNorm = 593.9800, GNorm = 0.2784, lr_0 = 9.9669e-04
Loss = 8.1449e-02, PNorm = 594.0250, GNorm = 3.6239, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.097004
Epoch 4990
Loss = 1.9140e-02, PNorm = 594.0699, GNorm = 0.3146, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.087641
Epoch 4991
Loss = 1.4406e-02, PNorm = 594.1129, GNorm = 2.3798, lr_0 = 9.9669e-04
Validation binary_cross_entropy = 0.102640
Epoch 4992
Loss = 5.4794e-02, PNorm = 594.1486, GNorm = 3.2971, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.101858
Epoch 4993
Loss = 3.9607e-02, PNorm = 594.2050, GNorm = 1.1528, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.102417
Epoch 4994
Loss = 3.3696e-02, PNorm = 594.2582, GNorm = 2.1756, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.091187
Epoch 4995
Loss = 1.0531e-02, PNorm = 594.3257, GNorm = 0.2146, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.104966
Epoch 4996
Loss = 7.6245e-03, PNorm = 594.3753, GNorm = 0.1824, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.110928
Epoch 4997
Loss = 5.5756e-03, PNorm = 594.4128, GNorm = 0.6829, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.116585
Epoch 4998
Loss = 3.7057e-03, PNorm = 594.4431, GNorm = 0.0408, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.126463
Epoch 4999
Loss = 8.3991e-03, PNorm = 594.4820, GNorm = 0.5123, lr_0 = 9.9668e-04
Loss = 3.6991e-02, PNorm = 594.5403, GNorm = 1.1318, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.111412
Epoch 5000
Loss = 2.3218e-02, PNorm = 594.6135, GNorm = 0.8724, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.140450
Epoch 5001
Loss = 1.0612e-01, PNorm = 594.6646, GNorm = 1.7648, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.100144
Epoch 5002
Loss = 4.8999e-02, PNorm = 594.7317, GNorm = 1.0342, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.086659
Epoch 5003
Loss = 4.4338e-02, PNorm = 594.8064, GNorm = 0.6075, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.076311
Epoch 5004
Loss = 6.4450e-02, PNorm = 594.8683, GNorm = 0.9836, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.081477
Epoch 5005
Loss = 1.4143e-02, PNorm = 594.9465, GNorm = 1.1368, lr_0 = 9.9668e-04
Validation binary_cross_entropy = 0.106778
Epoch 5006
Loss = 2.7297e-02, PNorm = 594.9975, GNorm = 2.9336, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.097451
Epoch 5007
Loss = 8.3890e-03, PNorm = 595.0338, GNorm = 0.1145, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.087554
Epoch 5008
Loss = 5.0577e-03, PNorm = 595.0594, GNorm = 0.1265, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.068587
Epoch 5009
Loss = 5.2065e-02, PNorm = 595.1066, GNorm = 2.8906, lr_0 = 9.9667e-04
Loss = 4.1278e-02, PNorm = 595.1900, GNorm = 1.1642, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.103206
Epoch 5010
Loss = 3.9970e-02, PNorm = 595.2462, GNorm = 0.2884, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.081136
Epoch 5011
Loss = 4.7837e-02, PNorm = 595.3160, GNorm = 0.8957, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.079048
Epoch 5012
Loss = 7.0548e-02, PNorm = 595.4019, GNorm = 1.5750, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.082787
Epoch 5013
Loss = 1.8856e-02, PNorm = 595.4726, GNorm = 0.2212, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.084188
Epoch 5014
Loss = 2.1880e-02, PNorm = 595.5328, GNorm = 0.9751, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.105174
Epoch 5015
Loss = 4.4108e-02, PNorm = 595.5706, GNorm = 0.9563, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.080972
Epoch 5016
Loss = 1.3400e-02, PNorm = 595.6106, GNorm = 0.1354, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.072507
Epoch 5017
Loss = 1.2379e-01, PNorm = 595.6559, GNorm = 1.6171, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.067497
Epoch 5018
Loss = 4.6855e-02, PNorm = 595.7268, GNorm = 4.8031, lr_0 = 9.9667e-04
Validation binary_cross_entropy = 0.067637
Epoch 5019
Loss = 1.6480e-02, PNorm = 595.8193, GNorm = 0.8349, lr_0 = 9.9666e-04
Loss = 3.3149e-02, PNorm = 595.9087, GNorm = 0.0754, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.102016
Epoch 5020
Loss = 4.9874e-02, PNorm = 595.9645, GNorm = 1.4094, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.090951
Epoch 5021
Loss = 2.6792e-02, PNorm = 596.0099, GNorm = 0.5981, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.083794
Epoch 5022
Loss = 4.5225e-02, PNorm = 596.0674, GNorm = 0.9538, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.086257
Epoch 5023
Loss = 1.4362e-02, PNorm = 596.1158, GNorm = 0.6988, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.091089
Epoch 5024
Loss = 5.0528e-03, PNorm = 596.1553, GNorm = 1.5917, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.120792
Epoch 5025
Loss = 4.0775e-03, PNorm = 596.1852, GNorm = 0.6356, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.104395
Epoch 5026
Loss = 6.5687e-02, PNorm = 596.2306, GNorm = 6.5488, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.118086
Epoch 5027
Loss = 7.8650e-02, PNorm = 596.2984, GNorm = 0.1039, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.073056
Epoch 5028
Loss = 1.3086e-02, PNorm = 596.3531, GNorm = 1.5105, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.069771
Epoch 5029
Loss = 1.8785e-02, PNorm = 596.4185, GNorm = 0.9288, lr_0 = 9.9666e-04
Loss = 4.9263e-02, PNorm = 596.4764, GNorm = 0.6864, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.073825
Epoch 5030
Loss = 7.7921e-02, PNorm = 596.5382, GNorm = 0.6008, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.063689
Epoch 5031
Loss = 6.1363e-02, PNorm = 596.6174, GNorm = 4.9377, lr_0 = 9.9666e-04
Validation binary_cross_entropy = 0.079133
Epoch 5032
Loss = 2.4476e-02, PNorm = 596.7035, GNorm = 0.8282, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.091427
Epoch 5033
Loss = 3.7616e-02, PNorm = 596.7637, GNorm = 1.2321, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.095335
Epoch 5034
Loss = 5.9984e-03, PNorm = 596.8130, GNorm = 0.0516, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.096096
Epoch 5035
Loss = 1.7662e-02, PNorm = 596.8584, GNorm = 1.4866, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.097806
Epoch 5036
Loss = 4.5254e-03, PNorm = 596.8988, GNorm = 0.1349, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.091123
Epoch 5037
Loss = 3.9653e-03, PNorm = 596.9389, GNorm = 0.5901, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.093215
Epoch 5038
Loss = 9.0077e-03, PNorm = 596.9692, GNorm = 0.0659, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.081723
Epoch 5039
Loss = 1.5733e-01, PNorm = 596.9918, GNorm = 1.7352, lr_0 = 9.9665e-04
Loss = 7.2944e-02, PNorm = 597.0742, GNorm = 1.8007, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.076967
Epoch 5040
Loss = 6.4202e-02, PNorm = 597.1789, GNorm = 0.4518, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.069285
Epoch 5041
Loss = 4.9757e-02, PNorm = 597.2788, GNorm = 1.0524, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.084721
Epoch 5042
Loss = 3.8511e-02, PNorm = 597.3466, GNorm = 1.3723, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.071761
Epoch 5043
Loss = 5.7100e-02, PNorm = 597.4228, GNorm = 3.5464, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.081069
Epoch 5044
Loss = 2.6006e-02, PNorm = 597.5002, GNorm = 1.6079, lr_0 = 9.9665e-04
Validation binary_cross_entropy = 0.087891
Epoch 5045
Loss = 7.0086e-02, PNorm = 597.5599, GNorm = 0.8904, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.085238
Epoch 5046
Loss = 1.3144e-02, PNorm = 597.6021, GNorm = 0.1307, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.088213
Epoch 5047
Loss = 2.1014e-02, PNorm = 597.6458, GNorm = 0.5825, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.082076
Epoch 5048
Loss = 2.6731e-02, PNorm = 597.6956, GNorm = 0.6127, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.092000
Epoch 5049
Loss = 7.9812e-03, PNorm = 597.7600, GNorm = 0.6236, lr_0 = 9.9664e-04
Loss = 4.1901e-02, PNorm = 597.8319, GNorm = 0.0645, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.105439
Epoch 5050
Loss = 4.2935e-02, PNorm = 597.8898, GNorm = 2.1318, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.079180
Epoch 5051
Loss = 8.3869e-02, PNorm = 597.9828, GNorm = 0.3222, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.074446
Epoch 5052
Loss = 7.5699e-02, PNorm = 598.0917, GNorm = 7.2498, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.086739
Epoch 5053
Loss = 6.8125e-02, PNorm = 598.1886, GNorm = 1.7445, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.082079
Epoch 5054
Loss = 4.0641e-02, PNorm = 598.2650, GNorm = 2.1529, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.138349
Epoch 5055
Loss = 5.4444e-02, PNorm = 598.3401, GNorm = 0.1183, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.118739
Epoch 5056
Loss = 2.0947e-02, PNorm = 598.4029, GNorm = 3.1145, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.137865
Epoch 5057
Loss = 2.8748e-02, PNorm = 598.4665, GNorm = 0.1186, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.120886
Epoch 5058
Loss = 3.3364e-02, PNorm = 598.5486, GNorm = 2.4759, lr_0 = 9.9664e-04
Validation binary_cross_entropy = 0.131559
Epoch 5059
Loss = 9.8707e-02, PNorm = 598.6486, GNorm = 2.2001, lr_0 = 9.9663e-04
Loss = 5.2513e-02, PNorm = 598.7434, GNorm = 2.5177, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.082398
Epoch 5060
Loss = 5.9937e-02, PNorm = 598.8364, GNorm = 0.4447, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.077719
Epoch 5061
Loss = 1.0266e-01, PNorm = 598.9257, GNorm = 0.1913, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.083493
Epoch 5062
Loss = 2.6361e-02, PNorm = 599.0025, GNorm = 3.3315, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.107910
Epoch 5063
Loss = 7.2693e-02, PNorm = 599.0555, GNorm = 0.0615, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.087081
Epoch 5064
Loss = 1.4512e-02, PNorm = 599.1063, GNorm = 0.2012, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.098365
Epoch 5065
Loss = 8.5384e-02, PNorm = 599.1550, GNorm = 1.3654, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.107628
Epoch 5066
Loss = 8.4172e-03, PNorm = 599.2092, GNorm = 0.0207, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.118396
Epoch 5067
Loss = 7.5134e-02, PNorm = 599.2493, GNorm = 0.0618, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.100230
Epoch 5068
Loss = 1.5154e-02, PNorm = 599.2801, GNorm = 1.1138, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.158980
Epoch 5069
Loss = 7.0444e-03, PNorm = 599.3274, GNorm = 0.3798, lr_0 = 9.9663e-04
Loss = 3.4560e-02, PNorm = 599.3778, GNorm = 2.1784, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.168290
Epoch 5070
Loss = 3.7256e-02, PNorm = 599.4423, GNorm = 0.0353, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.202318
Epoch 5071
Loss = 5.0664e-02, PNorm = 599.4953, GNorm = 4.4337, lr_0 = 9.9663e-04
Validation binary_cross_entropy = 0.185457
Epoch 5072
Loss = 2.5589e-01, PNorm = 599.5479, GNorm = 0.1301, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.095217
Epoch 5073
Loss = 9.7503e-02, PNorm = 599.6366, GNorm = 2.3331, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.128665
Epoch 5074
Loss = 8.5400e-02, PNorm = 599.7203, GNorm = 3.3901, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.107982
Epoch 5075
Loss = 5.8717e-02, PNorm = 599.7943, GNorm = 0.1177, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.120009
Epoch 5076
Loss = 2.7755e-02, PNorm = 599.8553, GNorm = 0.9282, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.097735
Epoch 5077
Loss = 4.7313e-01, PNorm = 599.9086, GNorm = 1.4268, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.060985
Epoch 5078
Loss = 8.1912e-02, PNorm = 599.9953, GNorm = 3.5281, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.070329
Epoch 5079
Loss = 1.0144e-02, PNorm = 600.0768, GNorm = 0.5703, lr_0 = 9.9662e-04
Loss = 3.5617e-02, PNorm = 600.1380, GNorm = 1.1203, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.067025
Epoch 5080
Loss = 6.0283e-02, PNorm = 600.1918, GNorm = 0.6396, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.066355
Epoch 5081
Loss = 3.5895e-02, PNorm = 600.2500, GNorm = 0.3590, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.090338
Epoch 5082
Loss = 6.9548e-02, PNorm = 600.3033, GNorm = 4.4038, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.080817
Epoch 5083
Loss = 4.3134e-02, PNorm = 600.3652, GNorm = 2.9248, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.080217
Epoch 5084
Loss = 2.0709e-02, PNorm = 600.4105, GNorm = 1.7002, lr_0 = 9.9662e-04
Validation binary_cross_entropy = 0.077619
Epoch 5085
Loss = 2.5548e-02, PNorm = 600.4444, GNorm = 1.1415, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.071533
Epoch 5086
Loss = 2.2761e-02, PNorm = 600.4879, GNorm = 0.5243, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.076156
Epoch 5087
Loss = 6.0361e-02, PNorm = 600.5560, GNorm = 1.0570, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.072400
Epoch 5088
Loss = 9.7071e-02, PNorm = 600.6086, GNorm = 1.6596, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.079590
Epoch 5089
Loss = 5.2126e-03, PNorm = 600.6558, GNorm = 0.2265, lr_0 = 9.9661e-04
Loss = 2.3110e-02, PNorm = 600.6942, GNorm = 0.1289, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.092234
Epoch 5090
Loss = 1.7747e-02, PNorm = 600.7347, GNorm = 0.0573, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.116151
Epoch 5091
Loss = 6.7861e-02, PNorm = 600.7682, GNorm = 0.2396, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.094191
Epoch 5092
Loss = 4.4073e-02, PNorm = 600.8410, GNorm = 0.7874, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.103799
Epoch 5093
Loss = 4.0640e-02, PNorm = 600.9050, GNorm = 0.0418, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.093770
Epoch 5094
Loss = 4.9302e-02, PNorm = 600.9466, GNorm = 0.4811, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.085786
Epoch 5095
Loss = 1.2682e-02, PNorm = 600.9857, GNorm = 1.9654, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.085155
Epoch 5096
Loss = 3.2721e-02, PNorm = 601.0299, GNorm = 1.0813, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.095364
Epoch 5097
Loss = 9.8964e-03, PNorm = 601.0850, GNorm = 0.0362, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.096062
Epoch 5098
Loss = 1.2951e-02, PNorm = 601.1272, GNorm = 1.5159, lr_0 = 9.9661e-04
Validation binary_cross_entropy = 0.107983
Epoch 5099
Loss = 8.3693e-04, PNorm = 601.1704, GNorm = 0.0313, lr_0 = 9.9660e-04
Loss = 1.7640e-02, PNorm = 601.2051, GNorm = 0.1513, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.108658
Epoch 5100
Loss = 5.5273e-02, PNorm = 601.2519, GNorm = 0.2594, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.095005
Epoch 5101
Loss = 4.2067e-02, PNorm = 601.3216, GNorm = 0.6855, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.091322
Epoch 5102
Loss = 6.6826e-02, PNorm = 601.4026, GNorm = 0.0618, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.118326
Epoch 5103
Loss = 4.3844e-02, PNorm = 601.4620, GNorm = 0.0570, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.092035
Epoch 5104
Loss = 4.3999e-02, PNorm = 601.4940, GNorm = 1.8310, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.079533
Epoch 5105
Loss = 8.8213e-03, PNorm = 601.5317, GNorm = 0.6879, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.096595
Epoch 5106
Loss = 1.6609e-02, PNorm = 601.5832, GNorm = 3.4629, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.099023
Epoch 5107
Loss = 4.4282e-02, PNorm = 601.6209, GNorm = 0.2815, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.085230
Epoch 5108
Loss = 5.0573e-02, PNorm = 601.6641, GNorm = 2.9786, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.086196
Epoch 5109
Loss = 1.1625e-02, PNorm = 601.7248, GNorm = 0.7028, lr_0 = 9.9660e-04
Loss = 1.0642e-02, PNorm = 601.7871, GNorm = 0.1092, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.094356
Epoch 5110
Loss = 2.4841e-02, PNorm = 601.8292, GNorm = 0.1261, lr_0 = 9.9660e-04
Validation binary_cross_entropy = 0.097268
Epoch 5111
Loss = 1.9442e-02, PNorm = 601.8587, GNorm = 0.0389, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.094009
Epoch 5112
Loss = 1.8573e-02, PNorm = 601.8873, GNorm = 0.3635, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.105775
Epoch 5113
Loss = 2.5292e-02, PNorm = 601.9061, GNorm = 1.7252, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.095111
Epoch 5114
Loss = 7.0767e-02, PNorm = 601.9469, GNorm = 0.0249, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.097622
Epoch 5115
Loss = 5.5160e-02, PNorm = 602.0027, GNorm = 1.4645, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.090878
Epoch 5116
Loss = 2.9313e-02, PNorm = 602.0334, GNorm = 0.4360, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.084965
Epoch 5117
Loss = 2.5675e-02, PNorm = 602.0675, GNorm = 1.4957, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.093038
Epoch 5118
Loss = 1.9275e-03, PNorm = 602.1097, GNorm = 0.0573, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.109439
Epoch 5119
Loss = 2.0692e-03, PNorm = 602.1382, GNorm = 0.0973, lr_0 = 9.9659e-04
Loss = 8.2996e-02, PNorm = 602.1598, GNorm = 0.0867, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.082872
Epoch 5120
Loss = 3.4857e-02, PNorm = 602.2367, GNorm = 1.4109, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.084405
Epoch 5121
Loss = 4.1504e-02, PNorm = 602.3171, GNorm = 0.2188, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.094467
Epoch 5122
Loss = 3.4207e-02, PNorm = 602.3689, GNorm = 1.1821, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.091843
Epoch 5123
Loss = 4.7078e-02, PNorm = 602.4167, GNorm = 1.3121, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.097059
Epoch 5124
Loss = 2.2929e-02, PNorm = 602.4660, GNorm = 1.1448, lr_0 = 9.9659e-04
Validation binary_cross_entropy = 0.105421
Epoch 5125
Loss = 1.2441e-01, PNorm = 602.5043, GNorm = 4.4151, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.105175
Epoch 5126
Loss = 3.9856e-02, PNorm = 602.5433, GNorm = 2.9361, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.080692
Epoch 5127
Loss = 6.9260e-03, PNorm = 602.5886, GNorm = 0.1897, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.090135
Epoch 5128
Loss = 4.0250e-02, PNorm = 602.6274, GNorm = 0.0971, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.089735
Epoch 5129
Loss = 7.5689e-03, PNorm = 602.6667, GNorm = 0.4538, lr_0 = 9.9658e-04
Loss = 4.2756e-02, PNorm = 602.7093, GNorm = 2.0356, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.076789
Epoch 5130
Loss = 4.2607e-02, PNorm = 602.7664, GNorm = 0.1802, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.080473
Epoch 5131
Loss = 9.0550e-02, PNorm = 602.8193, GNorm = 0.8630, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.092876
Epoch 5132
Loss = 3.5066e-02, PNorm = 602.8615, GNorm = 3.0059, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.083595
Epoch 5133
Loss = 4.9601e-02, PNorm = 602.8910, GNorm = 0.7195, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.067660
Epoch 5134
Loss = 2.8709e-02, PNorm = 602.9404, GNorm = 1.0612, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.092423
Epoch 5135
Loss = 5.1013e-02, PNorm = 602.9868, GNorm = 1.6940, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.091375
Epoch 5136
Loss = 4.2600e-03, PNorm = 603.0178, GNorm = 0.1039, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.090534
Epoch 5137
Loss = 2.6920e-03, PNorm = 603.0446, GNorm = 0.0194, lr_0 = 9.9658e-04
Validation binary_cross_entropy = 0.106033
Epoch 5138
Loss = 3.8996e-02, PNorm = 603.0669, GNorm = 2.4753, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.071345
Epoch 5139
Loss = 1.2272e-02, PNorm = 603.1023, GNorm = 0.7414, lr_0 = 9.9657e-04
Loss = 1.7236e-02, PNorm = 603.1621, GNorm = 0.0869, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.094227
Epoch 5140
Loss = 4.7015e-02, PNorm = 603.2013, GNorm = 3.1640, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.092248
Epoch 5141
Loss = 1.5356e-02, PNorm = 603.2378, GNorm = 0.5029, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.097854
Epoch 5142
Loss = 1.2009e-02, PNorm = 603.2730, GNorm = 0.1599, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.153482
Epoch 5143
Loss = 3.6754e-02, PNorm = 603.3121, GNorm = 0.2864, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.252986
Epoch 5144
Loss = 3.1309e-02, PNorm = 603.3688, GNorm = 0.0566, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.087664
Epoch 5145
Loss = 2.0488e-02, PNorm = 603.4377, GNorm = 3.4890, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.078055
Epoch 5146
Loss = 3.0235e-02, PNorm = 603.5057, GNorm = 0.1462, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.093208
Epoch 5147
Loss = 4.0871e-02, PNorm = 603.5702, GNorm = 0.4111, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.081646
Epoch 5148
Loss = 1.9564e-02, PNorm = 603.6273, GNorm = 0.7232, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.124336
Epoch 5149
Loss = 1.2415e-02, PNorm = 603.6829, GNorm = 1.2077, lr_0 = 9.9657e-04
Loss = 3.4749e-02, PNorm = 603.7261, GNorm = 0.5827, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.096959
Epoch 5150
Loss = 6.2266e-02, PNorm = 603.8066, GNorm = 0.5361, lr_0 = 9.9657e-04
Validation binary_cross_entropy = 0.102708
Epoch 5151
Loss = 1.3607e-01, PNorm = 603.8864, GNorm = 0.4256, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.071636
Epoch 5152
Loss = 2.4936e-02, PNorm = 603.9467, GNorm = 0.3878, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.059719
Epoch 5153
Loss = 2.4189e-02, PNorm = 603.9961, GNorm = 0.2663, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.070224
Epoch 5154
Loss = 3.1887e-02, PNorm = 604.0374, GNorm = 2.2837, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.074448
Epoch 5155
Loss = 2.2995e-02, PNorm = 604.0635, GNorm = 0.5482, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.069534
Epoch 5156
Loss = 1.4711e-02, PNorm = 604.0943, GNorm = 0.0304, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.090420
Epoch 5157
Loss = 2.3470e-02, PNorm = 604.1325, GNorm = 3.3528, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.085315
Epoch 5158
Loss = 2.5763e-03, PNorm = 604.1521, GNorm = 0.1003, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.088360
Epoch 5159
Loss = 5.3806e-03, PNorm = 604.1709, GNorm = 0.2524, lr_0 = 9.9656e-04
Loss = 1.3677e-02, PNorm = 604.1962, GNorm = 0.0425, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.088589
Epoch 5160
Loss = 4.0426e-02, PNorm = 604.2351, GNorm = 3.2193, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.105081
Epoch 5161
Loss = 1.4296e-02, PNorm = 604.2632, GNorm = 1.0691, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.084123
Epoch 5162
Loss = 1.0858e-02, PNorm = 604.2918, GNorm = 0.2345, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.086385
Epoch 5163
Loss = 2.1703e-02, PNorm = 604.3231, GNorm = 2.2000, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.088489
Epoch 5164
Loss = 1.0984e-02, PNorm = 604.3537, GNorm = 0.5704, lr_0 = 9.9656e-04
Validation binary_cross_entropy = 0.095907
Epoch 5165
Loss = 1.5621e-02, PNorm = 604.3807, GNorm = 0.3329, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.101239
Epoch 5166
Loss = 3.0576e-03, PNorm = 604.4045, GNorm = 0.0750, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.098728
Epoch 5167
Loss = 4.1486e-03, PNorm = 604.4207, GNorm = 0.2420, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.098691
Epoch 5168
Loss = 2.0414e-02, PNorm = 604.4380, GNorm = 0.9239, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.105118
Epoch 5169
Loss = 4.2050e-02, PNorm = 604.4692, GNorm = 1.1751, lr_0 = 9.9655e-04
Loss = 8.9125e-03, PNorm = 604.5132, GNorm = 0.0585, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.105177
Epoch 5170
Loss = 3.4632e-02, PNorm = 604.5499, GNorm = 1.7860, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.109029
Epoch 5171
Loss = 2.0640e-02, PNorm = 604.5817, GNorm = 4.1147, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.094138
Epoch 5172
Loss = 1.2830e-02, PNorm = 604.6187, GNorm = 2.0594, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.088125
Epoch 5173
Loss = 3.5730e-02, PNorm = 604.6679, GNorm = 0.8265, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.093526
Epoch 5174
Loss = 8.9511e-03, PNorm = 604.7068, GNorm = 0.0721, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.095586
Epoch 5175
Loss = 5.9382e-03, PNorm = 604.7293, GNorm = 0.0800, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.081768
Epoch 5176
Loss = 8.8704e-02, PNorm = 604.7515, GNorm = 6.8044, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.066398
Epoch 5177
Loss = 1.2460e-01, PNorm = 604.8275, GNorm = 1.0313, lr_0 = 9.9655e-04
Validation binary_cross_entropy = 0.068488
Epoch 5178
Loss = 3.6828e-02, PNorm = 604.9072, GNorm = 1.0714, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.074482
Epoch 5179
Loss = 6.4266e-03, PNorm = 604.9722, GNorm = 0.3031, lr_0 = 9.9654e-04
Loss = 1.9162e-02, PNorm = 605.0229, GNorm = 0.6347, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.089162
Epoch 5180
Loss = 1.4856e-02, PNorm = 605.0617, GNorm = 0.0327, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.087594
Epoch 5181
Loss = 2.0476e-02, PNorm = 605.0865, GNorm = 1.4812, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.087120
Epoch 5182
Loss = 4.1902e-02, PNorm = 605.1144, GNorm = 0.1739, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.079525
Epoch 5183
Loss = 4.1862e-02, PNorm = 605.1436, GNorm = 0.7025, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.073944
Epoch 5184
Loss = 2.4000e-02, PNorm = 605.1754, GNorm = 0.1781, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.083479
Epoch 5185
Loss = 4.0046e-02, PNorm = 605.2064, GNorm = 3.4210, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.084353
Epoch 5186
Loss = 9.3179e-03, PNorm = 605.2332, GNorm = 0.6737, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.087289
Epoch 5187
Loss = 7.7641e-02, PNorm = 605.2630, GNorm = 1.6839, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.085914
Epoch 5188
Loss = 1.0462e-01, PNorm = 605.2904, GNorm = 3.0954, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.084607
Epoch 5189
Loss = 2.2804e-03, PNorm = 605.3253, GNorm = 0.0759, lr_0 = 9.9654e-04
Loss = 3.3562e-02, PNorm = 605.3575, GNorm = 0.1014, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.086540
Epoch 5190
Loss = 1.0202e-02, PNorm = 605.3861, GNorm = 0.2318, lr_0 = 9.9654e-04
Validation binary_cross_entropy = 0.092727
Epoch 5191
Loss = 1.5641e-02, PNorm = 605.4063, GNorm = 0.4952, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.093058
Epoch 5192
Loss = 2.7877e-02, PNorm = 605.4296, GNorm = 1.5067, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.095133
Epoch 5193
Loss = 1.7382e-02, PNorm = 605.4598, GNorm = 1.3911, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.099715
Epoch 5194
Loss = 1.0919e-02, PNorm = 605.4955, GNorm = 1.6310, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.096463
Epoch 5195
Loss = 3.6667e-03, PNorm = 605.5214, GNorm = 0.1178, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.096283
Epoch 5196
Loss = 3.3430e-02, PNorm = 605.5453, GNorm = 1.0569, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.091932
Epoch 5197
Loss = 2.9146e-02, PNorm = 605.5701, GNorm = 2.8382, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.102661
Epoch 5198
Loss = 9.1937e-02, PNorm = 605.6053, GNorm = 0.3679, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.111081
Epoch 5199
Loss = 4.7474e-04, PNorm = 605.6452, GNorm = 0.0203, lr_0 = 9.9653e-04
Loss = 2.9213e-02, PNorm = 605.6801, GNorm = 0.7265, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.099798
Epoch 5200
Loss = 1.5517e-02, PNorm = 605.7248, GNorm = 0.9060, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.104941
Epoch 5201
Loss = 2.0362e-02, PNorm = 605.7606, GNorm = 0.0345, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.104138
Epoch 5202
Loss = 2.2197e-02, PNorm = 605.8022, GNorm = 0.2103, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.098943
Epoch 5203
Loss = 3.1778e-03, PNorm = 605.8350, GNorm = 0.0663, lr_0 = 9.9653e-04
Validation binary_cross_entropy = 0.099750
Epoch 5204
Loss = 8.9922e-03, PNorm = 605.8655, GNorm = 0.1686, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.082800
Epoch 5205
Loss = 3.5819e-01, PNorm = 605.9852, GNorm = 0.5924, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.071400
Epoch 5206
Loss = 5.2040e-02, PNorm = 606.1832, GNorm = 2.1888, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.098593
Epoch 5207
Loss = 6.2389e-02, PNorm = 606.3177, GNorm = 2.1674, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.082377
Epoch 5208
Loss = 1.2330e-01, PNorm = 606.4078, GNorm = 2.5625, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.073366
Epoch 5209
Loss = 1.2150e-01, PNorm = 606.4850, GNorm = 0.9393, lr_0 = 9.9652e-04
Loss = 7.0573e-02, PNorm = 606.5569, GNorm = 1.9474, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.071200
Epoch 5210
Loss = 3.9537e-02, PNorm = 606.6199, GNorm = 0.4956, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.070836
Epoch 5211
Loss = 3.3926e-02, PNorm = 606.6779, GNorm = 4.2715, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.084309
Epoch 5212
Loss = 4.3072e-02, PNorm = 606.7322, GNorm = 2.0254, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.080551
Epoch 5213
Loss = 1.9071e-02, PNorm = 606.7855, GNorm = 0.1152, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.069170
Epoch 5214
Loss = 4.4038e-02, PNorm = 606.8318, GNorm = 5.5091, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.068864
Epoch 5215
Loss = 9.5888e-02, PNorm = 606.8884, GNorm = 4.5507, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.076528
Epoch 5216
Loss = 8.1038e-03, PNorm = 606.9360, GNorm = 0.7467, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.076142
Epoch 5217
Loss = 7.0187e-03, PNorm = 606.9812, GNorm = 0.1441, lr_0 = 9.9652e-04
Validation binary_cross_entropy = 0.088041
Epoch 5218
Loss = 1.4187e-02, PNorm = 607.0223, GNorm = 0.6735, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.088758
Epoch 5219
Loss = 2.8885e-03, PNorm = 607.0541, GNorm = 0.2585, lr_0 = 9.9651e-04
Loss = 2.8096e-02, PNorm = 607.0808, GNorm = 1.8107, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.081587
Epoch 5220
Loss = 2.8378e-02, PNorm = 607.1203, GNorm = 0.6337, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.074126
Epoch 5221
Loss = 3.5669e-02, PNorm = 607.1596, GNorm = 3.3538, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.071192
Epoch 5222
Loss = 3.5592e-02, PNorm = 607.2004, GNorm = 2.4291, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.066950
Epoch 5223
Loss = 5.5972e-02, PNorm = 607.2668, GNorm = 1.9007, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.069347
Epoch 5224
Loss = 1.6530e-02, PNorm = 607.3379, GNorm = 0.4782, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.091277
Epoch 5225
Loss = 1.9040e-02, PNorm = 607.3971, GNorm = 2.3421, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.110801
Epoch 5226
Loss = 1.3505e-02, PNorm = 607.4343, GNorm = 0.0092, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.150450
Epoch 5227
Loss = 1.5434e-03, PNorm = 607.4698, GNorm = 0.0008, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.099300
Epoch 5228
Loss = 2.6059e-01, PNorm = 607.5488, GNorm = 1.7280, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.093183
Epoch 5229
Loss = 2.1673e-02, PNorm = 607.6414, GNorm = 1.0786, lr_0 = 9.9651e-04
Loss = 4.5769e-02, PNorm = 607.6964, GNorm = 0.2427, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.087973
Epoch 5230
Loss = 5.0543e-02, PNorm = 607.7415, GNorm = 1.0184, lr_0 = 9.9651e-04
Validation binary_cross_entropy = 0.057345
Epoch 5231
Loss = 5.2578e-02, PNorm = 607.8122, GNorm = 0.8781, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.061788
Epoch 5232
Loss = 3.3462e-02, PNorm = 607.8745, GNorm = 0.5495, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.062141
Epoch 5233
Loss = 1.0581e-02, PNorm = 607.9336, GNorm = 0.4453, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.076087
Epoch 5234
Loss = 8.2761e-02, PNorm = 607.9791, GNorm = 0.2062, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.060904
Epoch 5235
Loss = 3.8015e-02, PNorm = 608.0407, GNorm = 1.0248, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.069759
Epoch 5236
Loss = 2.2851e-02, PNorm = 608.1101, GNorm = 0.5027, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.062037
Epoch 5237
Loss = 5.0143e-03, PNorm = 608.1628, GNorm = 0.1570, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.061603
Epoch 5238
Loss = 3.5475e-02, PNorm = 608.2364, GNorm = 3.9166, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.075627
Epoch 5239
Loss = 2.3246e-03, PNorm = 608.2968, GNorm = 0.1481, lr_0 = 9.9650e-04
Loss = 1.5404e-02, PNorm = 608.3292, GNorm = 0.8078, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.082826
Epoch 5240
Loss = 2.3767e-02, PNorm = 608.3480, GNorm = 0.5986, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.075986
Epoch 5241
Loss = 8.9122e-03, PNorm = 608.3607, GNorm = 0.2001, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.071553
Epoch 5242
Loss = 4.1853e-02, PNorm = 608.3904, GNorm = 0.3881, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.076427
Epoch 5243
Loss = 1.4458e-02, PNorm = 608.4240, GNorm = 0.0965, lr_0 = 9.9650e-04
Validation binary_cross_entropy = 0.074379
Epoch 5244
Loss = 3.0749e-02, PNorm = 608.4593, GNorm = 3.6605, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.071540
Epoch 5245
Loss = 1.2028e-02, PNorm = 608.5101, GNorm = 0.2497, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.074433
Epoch 5246
Loss = 2.5598e-02, PNorm = 608.5471, GNorm = 0.0265, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.077227
Epoch 5247
Loss = 1.5372e-02, PNorm = 608.5927, GNorm = 1.1526, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.115386
Epoch 5248
Loss = 3.5705e-02, PNorm = 608.6376, GNorm = 0.0109, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.094264
Epoch 5249
Loss = 4.0595e-03, PNorm = 608.6707, GNorm = 0.3005, lr_0 = 9.9649e-04
Loss = 4.2058e-02, PNorm = 608.7435, GNorm = 0.6102, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.069789
Epoch 5250
Loss = 3.0447e-02, PNorm = 608.8156, GNorm = 0.2589, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.080564
Epoch 5251
Loss = 4.9959e-02, PNorm = 608.8708, GNorm = 1.9468, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.075033
Epoch 5252
Loss = 3.2845e-02, PNorm = 608.9223, GNorm = 0.3744, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.074611
Epoch 5253
Loss = 6.8621e-03, PNorm = 608.9774, GNorm = 0.2753, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.089336
Epoch 5254
Loss = 1.7713e-02, PNorm = 609.0148, GNorm = 0.3290, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.090164
Epoch 5255
Loss = 3.3463e-02, PNorm = 609.0461, GNorm = 0.8729, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.073848
Epoch 5256
Loss = 3.8630e-02, PNorm = 609.0883, GNorm = 0.1428, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.068042
Epoch 5257
Loss = 4.3233e-03, PNorm = 609.1233, GNorm = 0.0715, lr_0 = 9.9649e-04
Validation binary_cross_entropy = 0.081680
Epoch 5258
Loss = 6.4609e-03, PNorm = 609.1682, GNorm = 0.4555, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.083003
Epoch 5259
Loss = 2.5709e-03, PNorm = 609.2052, GNorm = 0.1907, lr_0 = 9.9648e-04
Loss = 5.0643e-02, PNorm = 609.2493, GNorm = 2.2298, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.083237
Epoch 5260
Loss = 3.0725e-02, PNorm = 609.3117, GNorm = 0.3825, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.073165
Epoch 5261
Loss = 9.1449e-03, PNorm = 609.3643, GNorm = 0.1264, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.081573
Epoch 5262
Loss = 2.3254e-02, PNorm = 609.3995, GNorm = 0.1723, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.081811
Epoch 5263
Loss = 7.6238e-03, PNorm = 609.4249, GNorm = 0.8245, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.103831
Epoch 5264
Loss = 2.0091e-02, PNorm = 609.4486, GNorm = 0.6220, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.106392
Epoch 5265
Loss = 1.5916e-02, PNorm = 609.4670, GNorm = 1.0390, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.086715
Epoch 5266
Loss = 6.5332e-02, PNorm = 609.4874, GNorm = 1.5594, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.084915
Epoch 5267
Loss = 3.6635e-03, PNorm = 609.5279, GNorm = 0.0776, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.087039
Epoch 5268
Loss = 1.1105e-02, PNorm = 609.5572, GNorm = 0.7915, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.072311
Epoch 5269
Loss = 6.8832e-03, PNorm = 609.5777, GNorm = 0.2640, lr_0 = 9.9648e-04
Loss = 2.4697e-02, PNorm = 609.6112, GNorm = 1.5486, lr_0 = 9.9648e-04
Validation binary_cross_entropy = 0.076574
Epoch 5270
Loss = 2.1953e-02, PNorm = 609.6487, GNorm = 4.7855, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.090147
Epoch 5271
Loss = 6.4089e-02, PNorm = 609.6871, GNorm = 5.7022, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.073825
Epoch 5272
Loss = 5.9141e-02, PNorm = 609.7723, GNorm = 0.7873, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.082409
Epoch 5273
Loss = 5.8069e-02, PNorm = 609.8495, GNorm = 0.1792, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.077259
Epoch 5274
Loss = 5.0811e-02, PNorm = 609.9113, GNorm = 3.0054, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.069877
Epoch 5275
Loss = 2.8360e-02, PNorm = 609.9785, GNorm = 0.1245, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.080750
Epoch 5276
Loss = 1.8080e-02, PNorm = 610.0375, GNorm = 0.7617, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.082182
Epoch 5277
Loss = 4.5628e-02, PNorm = 610.0769, GNorm = 0.3117, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.092251
Epoch 5278
Loss = 4.2368e-02, PNorm = 610.1205, GNorm = 0.8110, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.089797
Epoch 5279
Loss = 1.5730e-03, PNorm = 610.1567, GNorm = 0.0528, lr_0 = 9.9647e-04
Loss = 7.3644e-03, PNorm = 610.1942, GNorm = 0.0827, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.091944
Epoch 5280
Loss = 3.4389e-02, PNorm = 610.2314, GNorm = 0.1338, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.092294
Epoch 5281
Loss = 2.5765e-02, PNorm = 610.2822, GNorm = 0.2603, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.095984
Epoch 5282
Loss = 3.8844e-02, PNorm = 610.3343, GNorm = 0.9237, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.095680
Epoch 5283
Loss = 5.8657e-02, PNorm = 610.3815, GNorm = 0.3588, lr_0 = 9.9647e-04
Validation binary_cross_entropy = 0.091513
Epoch 5284
Loss = 3.3319e-02, PNorm = 610.4199, GNorm = 3.9917, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.073602
Epoch 5285
Loss = 3.2392e-02, PNorm = 610.4664, GNorm = 0.1033, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.082617
Epoch 5286
Loss = 1.8559e-02, PNorm = 610.5144, GNorm = 2.7209, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.082845
Epoch 5287
Loss = 2.9051e-02, PNorm = 610.5567, GNorm = 0.0810, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.065184
Epoch 5288
Loss = 3.9349e-02, PNorm = 610.6038, GNorm = 2.7676, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.077141
Epoch 5289
Loss = 3.4169e-02, PNorm = 610.6722, GNorm = 0.5380, lr_0 = 9.9646e-04
Loss = 4.8872e-02, PNorm = 610.7109, GNorm = 1.1376, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.070331
Epoch 5290
Loss = 1.1775e-02, PNorm = 610.7473, GNorm = 0.3433, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.067635
Epoch 5291
Loss = 1.7456e-02, PNorm = 610.7934, GNorm = 0.1341, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.070836
Epoch 5292
Loss = 2.0711e-02, PNorm = 610.8476, GNorm = 2.8365, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.074311
Epoch 5293
Loss = 2.3753e-02, PNorm = 610.8888, GNorm = 0.1078, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.073377
Epoch 5294
Loss = 7.2471e-03, PNorm = 610.9297, GNorm = 0.0336, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.079134
Epoch 5295
Loss = 2.5030e-02, PNorm = 610.9734, GNorm = 2.1874, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.080999
Epoch 5296
Loss = 1.9868e-02, PNorm = 611.0039, GNorm = 2.6141, lr_0 = 9.9646e-04
Validation binary_cross_entropy = 0.081346
Epoch 5297
Loss = 1.6368e-03, PNorm = 611.0404, GNorm = 0.0761, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.078886
Epoch 5298
Loss = 3.3688e-02, PNorm = 611.0832, GNorm = 0.1064, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.074829
Epoch 5299
Loss = 7.5271e-02, PNorm = 611.1240, GNorm = 2.4845, lr_0 = 9.9645e-04
Loss = 2.1205e-02, PNorm = 611.1792, GNorm = 0.2168, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.100052
Epoch 5300
Loss = 8.9869e-02, PNorm = 611.2341, GNorm = 0.7043, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.079157
Epoch 5301
Loss = 6.7459e-02, PNorm = 611.3039, GNorm = 1.2363, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.069778
Epoch 5302
Loss = 3.5344e-02, PNorm = 611.3689, GNorm = 0.8433, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.075218
Epoch 5303
Loss = 1.2217e-02, PNorm = 611.4256, GNorm = 0.2749, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.076833
Epoch 5304
Loss = 6.4650e-02, PNorm = 611.4777, GNorm = 0.0692, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.085166
Epoch 5305
Loss = 1.8315e-03, PNorm = 611.5458, GNorm = 0.1220, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.091318
Epoch 5306
Loss = 4.1544e-03, PNorm = 611.6074, GNorm = 0.4566, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.126030
Epoch 5307
Loss = 1.0028e-02, PNorm = 611.6550, GNorm = 0.0782, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.091326
Epoch 5308
Loss = 2.2941e-03, PNorm = 611.6847, GNorm = 0.0905, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.107312
Epoch 5309
Loss = 4.7997e-02, PNorm = 611.7269, GNorm = 2.6559, lr_0 = 9.9645e-04
Loss = 8.5309e-02, PNorm = 611.7824, GNorm = 36.1415, lr_0 = 9.9645e-04
Validation binary_cross_entropy = 0.125549
Epoch 5310
Loss = 3.0698e-02, PNorm = 611.8350, GNorm = 1.3111, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.073712
Epoch 5311
Loss = 3.4561e-02, PNorm = 611.8887, GNorm = 2.1814, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.080102
Epoch 5312
Loss = 2.7229e-02, PNorm = 611.9408, GNorm = 0.0800, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.085449
Epoch 5313
Loss = 2.5041e-02, PNorm = 611.9762, GNorm = 0.2346, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.083033
Epoch 5314
Loss = 4.8622e-02, PNorm = 612.0126, GNorm = 1.5093, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.103554
Epoch 5315
Loss = 6.2265e-03, PNorm = 612.0723, GNorm = 0.0742, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.122411
Epoch 5316
Loss = 4.8521e-02, PNorm = 612.1139, GNorm = 0.1287, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.090836
Epoch 5317
Loss = 4.7325e-02, PNorm = 612.1492, GNorm = 1.7705, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.075480
Epoch 5318
Loss = 2.4040e-02, PNorm = 612.1848, GNorm = 1.9856, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.094881
Epoch 5319
Loss = 8.0264e-03, PNorm = 612.2312, GNorm = 0.3511, lr_0 = 9.9644e-04
Loss = 2.3168e-02, PNorm = 612.2600, GNorm = 0.3303, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.081021
Epoch 5320
Loss = 3.0066e-02, PNorm = 612.2956, GNorm = 0.9281, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.078698
Epoch 5321
Loss = 7.5413e-03, PNorm = 612.3177, GNorm = 0.0542, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.091593
Epoch 5322
Loss = 2.3160e-02, PNorm = 612.3392, GNorm = 0.1724, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.095763
Epoch 5323
Loss = 6.9769e-02, PNorm = 612.3608, GNorm = 3.6584, lr_0 = 9.9644e-04
Validation binary_cross_entropy = 0.074819
Epoch 5324
Loss = 4.6768e-03, PNorm = 612.4107, GNorm = 0.0911, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.107020
Epoch 5325
Loss = 1.5042e-01, PNorm = 612.4580, GNorm = 0.4144, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.079309
Epoch 5326
Loss = 1.3874e-01, PNorm = 612.5523, GNorm = 3.0499, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.086805
Epoch 5327
Loss = 1.9134e-02, PNorm = 612.6572, GNorm = 0.4962, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.072127
Epoch 5328
Loss = 5.3935e-03, PNorm = 612.7354, GNorm = 0.2541, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.092394
Epoch 5329
Loss = 6.0530e-03, PNorm = 612.7953, GNorm = 0.2311, lr_0 = 9.9643e-04
Loss = 5.3159e-02, PNorm = 612.8375, GNorm = 0.0206, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.085783
Epoch 5330
Loss = 1.8469e-02, PNorm = 612.8761, GNorm = 0.1196, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.075191
Epoch 5331
Loss = 2.5595e-02, PNorm = 612.9109, GNorm = 0.0212, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.096014
Epoch 5332
Loss = 5.8171e-02, PNorm = 612.9430, GNorm = 0.0582, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.074278
Epoch 5333
Loss = 2.1486e-02, PNorm = 612.9985, GNorm = 0.3407, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.075189
Epoch 5334
Loss = 5.4876e-02, PNorm = 613.0570, GNorm = 1.6065, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.069746
Epoch 5335
Loss = 2.3813e-02, PNorm = 613.0981, GNorm = 0.8345, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.071469
Epoch 5336
Loss = 1.5244e-02, PNorm = 613.1336, GNorm = 0.1577, lr_0 = 9.9643e-04
Validation binary_cross_entropy = 0.094885
Epoch 5337
Loss = 5.3412e-02, PNorm = 613.1638, GNorm = 0.7718, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.079678
Epoch 5338
Loss = 7.2349e-03, PNorm = 613.1947, GNorm = 0.1105, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.100570
Epoch 5339
Loss = 1.2443e-01, PNorm = 613.2474, GNorm = 1.8699, lr_0 = 9.9642e-04
Loss = 2.2180e-02, PNorm = 613.2830, GNorm = 0.1501, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.093707
Epoch 5340
Loss = 3.0508e-02, PNorm = 613.3272, GNorm = 3.3812, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.107484
Epoch 5341
Loss = 3.2311e-02, PNorm = 613.3813, GNorm = 1.1314, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.118194
Epoch 5342
Loss = 3.2288e-02, PNorm = 613.4177, GNorm = 0.1209, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.097076
Epoch 5343
Loss = 2.0842e-02, PNorm = 613.4654, GNorm = 0.3137, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.100983
Epoch 5344
Loss = 7.8192e-03, PNorm = 613.5309, GNorm = 0.0609, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.120003
Epoch 5345
Loss = 9.5571e-03, PNorm = 613.5746, GNorm = 0.0914, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.093428
Epoch 5346
Loss = 1.8431e-02, PNorm = 613.6199, GNorm = 0.3848, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.086467
Epoch 5347
Loss = 2.7346e-02, PNorm = 613.6577, GNorm = 0.8144, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.059176
Epoch 5348
Loss = 9.3474e-03, PNorm = 613.7084, GNorm = 0.6431, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.107589
Epoch 5349
Loss = 1.8514e-02, PNorm = 613.7579, GNorm = 0.9746, lr_0 = 9.9642e-04
Loss = 5.3819e-03, PNorm = 613.7932, GNorm = 1.2108, lr_0 = 9.9642e-04
Validation binary_cross_entropy = 0.168589
Epoch 5350
Loss = 2.8471e-02, PNorm = 613.8135, GNorm = 2.1652, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.089214
Epoch 5351
Loss = 1.1320e-01, PNorm = 613.8854, GNorm = 1.1028, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.071847
Epoch 5352
Loss = 1.0355e-02, PNorm = 613.9702, GNorm = 0.2467, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.086206
Epoch 5353
Loss = 8.4665e-03, PNorm = 614.0279, GNorm = 0.6511, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.097829
Epoch 5354
Loss = 1.4739e-02, PNorm = 614.0698, GNorm = 0.2802, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.107624
Epoch 5355
Loss = 1.5351e-02, PNorm = 614.1107, GNorm = 0.3288, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.090109
Epoch 5356
Loss = 2.4101e-03, PNorm = 614.1406, GNorm = 0.0187, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.080796
Epoch 5357
Loss = 4.9991e-02, PNorm = 614.1792, GNorm = 0.0471, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.069521
Epoch 5358
Loss = 6.0115e-03, PNorm = 614.2521, GNorm = 0.1812, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.074622
Epoch 5359
Loss = 1.5514e-02, PNorm = 614.3382, GNorm = 0.7098, lr_0 = 9.9641e-04
Loss = 8.1821e-02, PNorm = 614.4336, GNorm = 1.5664, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.049294
Epoch 5360
Loss = 5.4219e-02, PNorm = 614.5564, GNorm = 1.4985, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.073673
Epoch 5361
Loss = 3.6618e-02, PNorm = 614.6315, GNorm = 0.5661, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.062303
Epoch 5362
Loss = 4.7628e-02, PNorm = 614.6907, GNorm = 1.4795, lr_0 = 9.9641e-04
Validation binary_cross_entropy = 0.069209
Epoch 5363
Loss = 3.5448e-02, PNorm = 614.7432, GNorm = 0.1161, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.061548
Epoch 5364
Loss = 1.2421e-02, PNorm = 614.7881, GNorm = 0.8349, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.068024
Epoch 5365
Loss = 3.3458e-02, PNorm = 614.8443, GNorm = 2.7383, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.073242
Epoch 5366
Loss = 3.8860e-03, PNorm = 614.8801, GNorm = 0.1122, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.067024
Epoch 5367
Loss = 8.9500e-03, PNorm = 614.9076, GNorm = 0.1907, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.077705
Epoch 5368
Loss = 4.0423e-03, PNorm = 614.9338, GNorm = 0.0899, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.169842
Epoch 5369
Loss = 6.9697e-05, PNorm = 614.9696, GNorm = 0.0063, lr_0 = 9.9640e-04
Loss = 3.8395e-02, PNorm = 615.0078, GNorm = 0.0454, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.115282
Epoch 5370
Loss = 8.4747e-03, PNorm = 615.0479, GNorm = 0.0136, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.113365
Epoch 5371
Loss = 3.2553e-02, PNorm = 615.0738, GNorm = 1.6387, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.082113
Epoch 5372
Loss = 1.4810e-02, PNorm = 615.1077, GNorm = 2.2589, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.100645
Epoch 5373
Loss = 1.1258e-02, PNorm = 615.1507, GNorm = 0.0446, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.102852
Epoch 5374
Loss = 4.4741e-02, PNorm = 615.1859, GNorm = 0.1764, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.071765
Epoch 5375
Loss = 1.7448e-02, PNorm = 615.2434, GNorm = 0.2865, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.081564
Epoch 5376
Loss = 4.9701e-02, PNorm = 615.3067, GNorm = 0.2452, lr_0 = 9.9640e-04
Validation binary_cross_entropy = 0.070486
Epoch 5377
Loss = 1.4675e-02, PNorm = 615.3506, GNorm = 1.0056, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.050867
Epoch 5378
Loss = 2.2696e-02, PNorm = 615.3978, GNorm = 0.2809, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.049094
Epoch 5379
Loss = 1.2863e-01, PNorm = 615.4812, GNorm = 3.9446, lr_0 = 9.9639e-04
Loss = 2.7564e-02, PNorm = 615.5561, GNorm = 1.0167, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.077705
Epoch 5380
Loss = 2.3610e-02, PNorm = 615.6017, GNorm = 1.6289, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.074884
Epoch 5381
Loss = 1.7964e-02, PNorm = 615.6358, GNorm = 1.1493, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.079627
Epoch 5382
Loss = 3.5928e-02, PNorm = 615.6795, GNorm = 0.1278, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.079768
Epoch 5383
Loss = 1.3484e-02, PNorm = 615.7228, GNorm = 0.3146, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.088051
Epoch 5384
Loss = 5.3763e-02, PNorm = 615.7647, GNorm = 4.4628, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.089384
Epoch 5385
Loss = 1.7834e-01, PNorm = 615.8056, GNorm = 0.5979, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.065820
Epoch 5386
Loss = 3.8103e-02, PNorm = 615.8738, GNorm = 3.7530, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.118270
Epoch 5387
Loss = 6.7411e-02, PNorm = 615.9443, GNorm = 1.6572, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.054972
Epoch 5388
Loss = 1.4369e-02, PNorm = 616.0061, GNorm = 0.7217, lr_0 = 9.9639e-04
Validation binary_cross_entropy = 0.060817
Epoch 5389
Loss = 4.1929e-03, PNorm = 616.0621, GNorm = 0.1428, lr_0 = 9.9639e-04
Loss = 3.8602e-02, PNorm = 616.1219, GNorm = 0.0264, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.082113
Epoch 5390
Loss = 1.8627e-02, PNorm = 616.1638, GNorm = 0.1352, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.072406
Epoch 5391
Loss = 5.4628e-02, PNorm = 616.1969, GNorm = 0.1544, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.064209
Epoch 5392
Loss = 2.2155e-02, PNorm = 616.2500, GNorm = 0.6125, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.081289
Epoch 5393
Loss = 1.4197e-02, PNorm = 616.2921, GNorm = 0.0187, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.081787
Epoch 5394
Loss = 1.2870e-02, PNorm = 616.3158, GNorm = 0.3236, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.078803
Epoch 5395
Loss = 1.5769e-03, PNorm = 616.3478, GNorm = 0.2271, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.076522
Epoch 5396
Loss = 9.3940e-02, PNorm = 616.3688, GNorm = 0.1309, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.059887
Epoch 5397
Loss = 2.1276e-02, PNorm = 616.4048, GNorm = 0.0534, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.062460
Epoch 5398
Loss = 5.5966e-02, PNorm = 616.4650, GNorm = 4.1051, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.068428
Epoch 5399
Loss = 3.4621e-03, PNorm = 616.5164, GNorm = 0.0986, lr_0 = 9.9638e-04
Loss = 2.4845e-02, PNorm = 616.5579, GNorm = 0.0972, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.067343
Epoch 5400
Loss = 2.0105e-02, PNorm = 616.5904, GNorm = 1.2359, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.068805
Epoch 5401
Loss = 8.3786e-03, PNorm = 616.6098, GNorm = 0.2689, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.063281
Epoch 5402
Loss = 2.7058e-02, PNorm = 616.6410, GNorm = 0.4510, lr_0 = 9.9638e-04
Validation binary_cross_entropy = 0.071196
Epoch 5403
Loss = 4.3836e-02, PNorm = 616.6803, GNorm = 6.2248, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.075862
Epoch 5404
Loss = 1.9617e-02, PNorm = 616.7224, GNorm = 3.9117, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.071278
Epoch 5405
Loss = 3.6659e-02, PNorm = 616.7673, GNorm = 6.4993, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.122779
Epoch 5406
Loss = 4.2210e-02, PNorm = 616.8230, GNorm = 2.2118, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.077322
Epoch 5407
Loss = 4.0352e-02, PNorm = 616.8575, GNorm = 2.7892, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.052904
Epoch 5408
Loss = 3.1784e-02, PNorm = 616.9095, GNorm = 0.5122, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.083807
Epoch 5409
Loss = 9.2875e-02, PNorm = 616.9709, GNorm = 5.1488, lr_0 = 9.9637e-04
Loss = 1.4768e-02, PNorm = 617.0154, GNorm = 0.0460, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.075200
Epoch 5410
Loss = 1.0417e-02, PNorm = 617.0539, GNorm = 0.1257, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.079790
Epoch 5411
Loss = 3.1052e-02, PNorm = 617.0980, GNorm = 0.1969, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.073197
Epoch 5412
Loss = 1.9336e-02, PNorm = 617.1372, GNorm = 1.4931, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.072014
Epoch 5413
Loss = 6.3665e-03, PNorm = 617.1722, GNorm = 0.3148, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.085147
Epoch 5414
Loss = 3.2907e-02, PNorm = 617.2048, GNorm = 1.3940, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.073553
Epoch 5415
Loss = 2.4896e-02, PNorm = 617.2450, GNorm = 0.1856, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.072531
Epoch 5416
Loss = 1.9855e-02, PNorm = 617.2966, GNorm = 0.0477, lr_0 = 9.9637e-04
Validation binary_cross_entropy = 0.081820
Epoch 5417
Loss = 3.8604e-02, PNorm = 617.3334, GNorm = 0.5087, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.085036
Epoch 5418
Loss = 1.0879e-02, PNorm = 617.3634, GNorm = 0.4039, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.096751
Epoch 5419
Loss = 1.6931e-03, PNorm = 617.4129, GNorm = 0.0888, lr_0 = 9.9636e-04
Loss = 1.3938e-02, PNorm = 617.4570, GNorm = 1.4129, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.096664
Epoch 5420
Loss = 3.9219e-03, PNorm = 617.4866, GNorm = 0.1077, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.101956
Epoch 5421
Loss = 1.2455e-02, PNorm = 617.5082, GNorm = 0.4568, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.104698
Epoch 5422
Loss = 9.3275e-02, PNorm = 617.5544, GNorm = 2.8124, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.073500
Epoch 5423
Loss = 8.7780e-02, PNorm = 617.6642, GNorm = 1.8904, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.130056
Epoch 5424
Loss = 1.1687e-01, PNorm = 617.7383, GNorm = 2.5351, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.060751
Epoch 5425
Loss = 3.0891e-02, PNorm = 617.8051, GNorm = 0.4188, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.061182
Epoch 5426
Loss = 2.0339e-02, PNorm = 617.8778, GNorm = 0.7411, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.088985
Epoch 5427
Loss = 2.1352e-02, PNorm = 617.9405, GNorm = 1.0134, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.100899
Epoch 5428
Loss = 3.6858e-03, PNorm = 617.9833, GNorm = 0.1996, lr_0 = 9.9636e-04
Validation binary_cross_entropy = 0.141035
Epoch 5429
Loss = 1.7269e-02, PNorm = 618.0140, GNorm = 1.9115, lr_0 = 9.9636e-04
Loss = 3.5983e-02, PNorm = 618.0628, GNorm = 2.9295, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.171165
Epoch 5430
Loss = 3.6036e-02, PNorm = 618.1080, GNorm = 0.3617, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.108065
Epoch 5431
Loss = 6.1433e-02, PNorm = 618.1674, GNorm = 5.2614, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.108945
Epoch 5432
Loss = 9.7397e-02, PNorm = 618.2272, GNorm = 1.3542, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.082587
Epoch 5433
Loss = 5.9511e-02, PNorm = 618.2958, GNorm = 0.2122, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.061784
Epoch 5434
Loss = 5.5147e-02, PNorm = 618.3748, GNorm = 1.4569, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.064137
Epoch 5435
Loss = 4.5360e-02, PNorm = 618.4359, GNorm = 1.3343, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.063284
Epoch 5436
Loss = 6.1473e-03, PNorm = 618.4979, GNorm = 0.2461, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.096807
Epoch 5437
Loss = 8.9406e-03, PNorm = 618.5430, GNorm = 0.0733, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.113991
Epoch 5438
Loss = 6.6616e-02, PNorm = 618.5717, GNorm = 1.3419, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.107238
Epoch 5439
Loss = 3.0506e-03, PNorm = 618.5923, GNorm = 0.6321, lr_0 = 9.9635e-04
Loss = 6.7196e-02, PNorm = 618.6493, GNorm = 0.3437, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.089232
Epoch 5440
Loss = 2.4498e-02, PNorm = 618.7168, GNorm = 0.8534, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.089281
Epoch 5441
Loss = 2.2526e-02, PNorm = 618.7578, GNorm = 2.4828, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.087164
Epoch 5442
Loss = 1.1133e-01, PNorm = 618.7967, GNorm = 5.4207, lr_0 = 9.9635e-04
Validation binary_cross_entropy = 0.082345
Epoch 5443
Loss = 3.2690e-02, PNorm = 618.8389, GNorm = 0.2378, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.076461
Epoch 5444
Loss = 3.7425e-02, PNorm = 618.8978, GNorm = 0.9981, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.141035
Epoch 5445
Loss = 1.9569e-02, PNorm = 618.9407, GNorm = 0.0124, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.150498
Epoch 5446
Loss = 1.9314e-02, PNorm = 618.9626, GNorm = 0.0424, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.126763
Epoch 5447
Loss = 1.3465e-01, PNorm = 618.9908, GNorm = 1.0115, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.134394
Epoch 5448
Loss = 3.1667e-02, PNorm = 619.0421, GNorm = 2.8293, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.187799
Epoch 5449
Loss = 1.0520e-02, PNorm = 619.0949, GNorm = 0.8021, lr_0 = 9.9634e-04
Loss = 1.5420e-02, PNorm = 619.1386, GNorm = 0.2468, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.107335
Epoch 5450
Loss = 3.4855e-02, PNorm = 619.1852, GNorm = 0.3218, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.102925
Epoch 5451
Loss = 3.1597e-02, PNorm = 619.2353, GNorm = 1.4037, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.098931
Epoch 5452
Loss = 4.1608e-02, PNorm = 619.2994, GNorm = 1.6653, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.104163
Epoch 5453
Loss = 2.4986e-02, PNorm = 619.3579, GNorm = 1.2518, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.097013
Epoch 5454
Loss = 1.1163e-01, PNorm = 619.4398, GNorm = 2.4059, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.079722
Epoch 5455
Loss = 3.3239e-02, PNorm = 619.5287, GNorm = 1.0297, lr_0 = 9.9634e-04
Validation binary_cross_entropy = 0.095069
Epoch 5456
Loss = 1.5894e-02, PNorm = 619.6168, GNorm = 0.3578, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.098030
Epoch 5457
Loss = 7.5431e-03, PNorm = 619.6880, GNorm = 0.5898, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.113371
Epoch 5458
Loss = 5.3079e-03, PNorm = 619.7325, GNorm = 0.0293, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.098459
Epoch 5459
Loss = 5.8496e-03, PNorm = 619.7844, GNorm = 0.1942, lr_0 = 9.9633e-04
Loss = 2.1234e-02, PNorm = 619.8473, GNorm = 1.6550, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.108016
Epoch 5460
Loss = 1.5728e-02, PNorm = 619.8927, GNorm = 0.1197, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.118705
Epoch 5461
Loss = 1.0753e-02, PNorm = 619.9252, GNorm = 0.8900, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.146825
Epoch 5462
Loss = 2.0267e-02, PNorm = 619.9576, GNorm = 0.0517, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.138124
Epoch 5463
Loss = 7.2441e-03, PNorm = 619.9870, GNorm = 1.5260, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.130061
Epoch 5464
Loss = 6.6300e-03, PNorm = 620.0161, GNorm = 0.2867, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.136209
Epoch 5465
Loss = 5.0015e-02, PNorm = 620.0538, GNorm = 0.9483, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.128941
Epoch 5466
Loss = 1.7099e-02, PNorm = 620.0835, GNorm = 0.0434, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.085434
Epoch 5467
Loss = 7.4814e-03, PNorm = 620.1389, GNorm = 0.1565, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.077210
Epoch 5468
Loss = 4.2238e-02, PNorm = 620.2212, GNorm = 0.6406, lr_0 = 9.9633e-04
Validation binary_cross_entropy = 0.089258
Epoch 5469
Loss = 7.3482e-03, PNorm = 620.2764, GNorm = 0.3303, lr_0 = 9.9633e-04
Loss = 2.9590e-02, PNorm = 620.3222, GNorm = 0.0529, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.083055
Epoch 5470
Loss = 1.8560e-02, PNorm = 620.3593, GNorm = 0.3579, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.075062
Epoch 5471
Loss = 6.1309e-02, PNorm = 620.4113, GNorm = 0.1421, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.077124
Epoch 5472
Loss = 1.8698e-02, PNorm = 620.4616, GNorm = 0.4198, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.072802
Epoch 5473
Loss = 2.2722e-02, PNorm = 620.5054, GNorm = 0.0679, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.083658
Epoch 5474
Loss = 4.1814e-02, PNorm = 620.5425, GNorm = 0.0615, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.079599
Epoch 5475
Loss = 8.4815e-03, PNorm = 620.5760, GNorm = 0.8056, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.075296
Epoch 5476
Loss = 1.0943e-02, PNorm = 620.6184, GNorm = 0.8394, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.087736
Epoch 5477
Loss = 1.1156e-03, PNorm = 620.6692, GNorm = 0.0989, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.101636
Epoch 5478
Loss = 1.2635e-01, PNorm = 620.7174, GNorm = 5.8354, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.084590
Epoch 5479
Loss = 1.1255e-03, PNorm = 620.7584, GNorm = 0.0398, lr_0 = 9.9632e-04
Loss = 3.1303e-02, PNorm = 620.7905, GNorm = 0.0279, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.091440
Epoch 5480
Loss = 7.4414e-03, PNorm = 620.8198, GNorm = 0.1556, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.098206
Epoch 5481
Loss = 4.3004e-02, PNorm = 620.8417, GNorm = 22.0726, lr_0 = 9.9632e-04
Validation binary_cross_entropy = 0.084383
Epoch 5482
Loss = 2.0821e-02, PNorm = 620.9091, GNorm = 2.8172, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.084130
Epoch 5483
Loss = 4.9100e-02, PNorm = 620.9904, GNorm = 0.6087, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.077299
Epoch 5484
Loss = 8.1305e-02, PNorm = 621.0660, GNorm = 0.0621, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.106363
Epoch 5485
Loss = 7.6715e-02, PNorm = 621.1232, GNorm = 0.0952, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.086116
Epoch 5486
Loss = 1.2983e-02, PNorm = 621.1838, GNorm = 0.1557, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.098394
Epoch 5487
Loss = 8.8210e-02, PNorm = 621.2398, GNorm = 3.9735, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.087597
Epoch 5488
Loss = 1.0752e-02, PNorm = 621.3005, GNorm = 0.5595, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.082618
Epoch 5489
Loss = 4.5233e-02, PNorm = 621.3571, GNorm = 2.4794, lr_0 = 9.9631e-04
Loss = 2.0420e-02, PNorm = 621.4209, GNorm = 2.8459, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.089674
Epoch 5490
Loss = 4.3806e-02, PNorm = 621.4754, GNorm = 0.7010, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.091172
Epoch 5491
Loss = 1.6014e-02, PNorm = 621.5390, GNorm = 0.4555, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.101881
Epoch 5492
Loss = 1.4900e-02, PNorm = 621.5962, GNorm = 0.1446, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.105069
Epoch 5493
Loss = 1.6715e-02, PNorm = 621.6390, GNorm = 3.3699, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.106845
Epoch 5494
Loss = 4.6059e-02, PNorm = 621.6874, GNorm = 0.0768, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.100049
Epoch 5495
Loss = 1.7252e-02, PNorm = 621.7539, GNorm = 0.3802, lr_0 = 9.9631e-04
Validation binary_cross_entropy = 0.121705
Epoch 5496
Loss = 1.2042e-02, PNorm = 621.8090, GNorm = 0.0654, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.127742
Epoch 5497
Loss = 6.0420e-02, PNorm = 621.8471, GNorm = 2.1901, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.072609
Epoch 5498
Loss = 3.3549e-02, PNorm = 621.9017, GNorm = 2.1488, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.056532
Epoch 5499
Loss = 3.2005e-02, PNorm = 621.9789, GNorm = 0.8954, lr_0 = 9.9630e-04
Loss = 2.0017e-02, PNorm = 622.0541, GNorm = 1.5984, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.068762
Epoch 5500
Loss = 5.2322e-02, PNorm = 622.1081, GNorm = 1.3668, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.069069
Epoch 5501
Loss = 4.5964e-02, PNorm = 622.1499, GNorm = 0.9450, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.056048
Epoch 5502
Loss = 1.8577e-02, PNorm = 622.2060, GNorm = 1.0787, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.062196
Epoch 5503
Loss = 4.3478e-02, PNorm = 622.2537, GNorm = 1.0687, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.074593
Epoch 5504
Loss = 4.7069e-02, PNorm = 622.3015, GNorm = 1.2256, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.077590
Epoch 5505
Loss = 3.2976e-02, PNorm = 622.3423, GNorm = 1.1682, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.078820
Epoch 5506
Loss = 4.3479e-02, PNorm = 622.3912, GNorm = 3.2283, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.080045
Epoch 5507
Loss = 8.7019e-03, PNorm = 622.4569, GNorm = 0.4129, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.095444
Epoch 5508
Loss = 7.7344e-01, PNorm = 622.5096, GNorm = 35.9686, lr_0 = 9.9630e-04
Validation binary_cross_entropy = 0.060359
Epoch 5509
Loss = 4.1014e-02, PNorm = 622.6086, GNorm = 1.4441, lr_0 = 9.9630e-04
Loss = 7.7962e-02, PNorm = 622.7045, GNorm = 4.6089, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.057558
Epoch 5510
Loss = 8.4670e-02, PNorm = 622.7804, GNorm = 2.4111, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.057952
Epoch 5511
Loss = 5.6090e-02, PNorm = 622.8468, GNorm = 1.8445, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.071514
Epoch 5512
Loss = 3.1549e-02, PNorm = 622.9063, GNorm = 1.6729, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.060390
Epoch 5513
Loss = 2.2398e-02, PNorm = 622.9551, GNorm = 2.5216, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.061075
Epoch 5514
Loss = 2.4665e-02, PNorm = 623.0234, GNorm = 0.1151, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.106745
Epoch 5515
Loss = 3.1381e-02, PNorm = 623.0775, GNorm = 1.1269, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.077895
Epoch 5516
Loss = 3.1095e-02, PNorm = 623.1118, GNorm = 0.2082, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.062574
Epoch 5517
Loss = 5.1910e-02, PNorm = 623.1557, GNorm = 1.7679, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.064177
Epoch 5518
Loss = 8.0392e-02, PNorm = 623.2266, GNorm = 0.2186, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.072595
Epoch 5519
Loss = 3.5239e-02, PNorm = 623.2758, GNorm = 0.4480, lr_0 = 9.9629e-04
Loss = 1.8164e-02, PNorm = 623.3081, GNorm = 1.3992, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.068330
Epoch 5520
Loss = 1.7952e-02, PNorm = 623.3476, GNorm = 0.5255, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.073470
Epoch 5521
Loss = 2.0044e-02, PNorm = 623.3878, GNorm = 0.0717, lr_0 = 9.9629e-04
Validation binary_cross_entropy = 0.087515
Epoch 5522
Loss = 1.8233e-02, PNorm = 623.4182, GNorm = 1.5186, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.082112
Epoch 5523
Loss = 3.9276e-02, PNorm = 623.4453, GNorm = 0.0990, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.073828
Epoch 5524
Loss = 1.5293e-02, PNorm = 623.4945, GNorm = 0.3559, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.087436
Epoch 5525
Loss = 5.4363e-03, PNorm = 623.5406, GNorm = 0.0412, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.091351
Epoch 5526
Loss = 5.7729e-02, PNorm = 623.5826, GNorm = 0.8128, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.096873
Epoch 5527
Loss = 1.6585e-02, PNorm = 623.6323, GNorm = 0.6775, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.073913
Epoch 5528
Loss = 1.9774e-02, PNorm = 623.6634, GNorm = 0.1273, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.089849
Epoch 5529
Loss = 2.9850e-03, PNorm = 623.7028, GNorm = 0.3975, lr_0 = 9.9628e-04
Loss = 9.8851e-03, PNorm = 623.7292, GNorm = 0.1256, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.082096
Epoch 5530
Loss = 9.6251e-03, PNorm = 623.7511, GNorm = 0.0694, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.085296
Epoch 5531
Loss = 4.0655e-02, PNorm = 623.7827, GNorm = 1.0312, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.067253
Epoch 5532
Loss = 3.4719e-02, PNorm = 623.8231, GNorm = 3.6426, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.062937
Epoch 5533
Loss = 3.9547e-02, PNorm = 623.8686, GNorm = 0.3680, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.058117
Epoch 5534
Loss = 2.8049e-02, PNorm = 623.9183, GNorm = 0.1118, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.058737
Epoch 5535
Loss = 6.9229e-03, PNorm = 623.9587, GNorm = 0.1264, lr_0 = 9.9628e-04
Validation binary_cross_entropy = 0.060302
Epoch 5536
Loss = 1.0414e-02, PNorm = 623.9894, GNorm = 0.1363, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.065752
Epoch 5537
Loss = 1.1505e-02, PNorm = 624.0322, GNorm = 0.9872, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.079868
Epoch 5538
Loss = 1.2862e-02, PNorm = 624.0645, GNorm = 0.5883, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.064908
Epoch 5539
Loss = 1.8146e-02, PNorm = 624.1013, GNorm = 0.8575, lr_0 = 9.9627e-04
Loss = 2.8284e-02, PNorm = 624.1770, GNorm = 1.0372, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.078063
Epoch 5540
Loss = 1.1413e-02, PNorm = 624.2283, GNorm = 0.4244, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.079007
Epoch 5541
Loss = 2.4357e-02, PNorm = 624.2605, GNorm = 0.0615, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.080053
Epoch 5542
Loss = 3.7272e-02, PNorm = 624.2893, GNorm = 0.0486, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.083832
Epoch 5543
Loss = 3.7529e-02, PNorm = 624.3290, GNorm = 0.3751, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.069666
Epoch 5544
Loss = 2.8926e-02, PNorm = 624.3634, GNorm = 0.2202, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.069263
Epoch 5545
Loss = 6.4175e-02, PNorm = 624.4049, GNorm = 1.8467, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.075522
Epoch 5546
Loss = 1.4102e-02, PNorm = 624.4676, GNorm = 0.0797, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.085024
Epoch 5547
Loss = 1.7957e-02, PNorm = 624.5108, GNorm = 0.9428, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.068440
Epoch 5548
Loss = 1.3703e-02, PNorm = 624.5440, GNorm = 0.1482, lr_0 = 9.9627e-04
Validation binary_cross_entropy = 0.067607
Epoch 5549
Loss = 6.7565e-03, PNorm = 624.5881, GNorm = 0.2479, lr_0 = 9.9626e-04
Loss = 3.7693e-02, PNorm = 624.6292, GNorm = 1.1842, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.068330
Epoch 5550
Loss = 3.4039e-02, PNorm = 624.6721, GNorm = 0.9100, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.068494
Epoch 5551
Loss = 1.5898e-02, PNorm = 624.7265, GNorm = 1.0037, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.076022
Epoch 5552
Loss = 2.8254e-02, PNorm = 624.7720, GNorm = 0.6890, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.078540
Epoch 5553
Loss = 3.6738e-03, PNorm = 624.8072, GNorm = 0.0571, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.081580
Epoch 5554
Loss = 5.6254e-02, PNorm = 624.8376, GNorm = 6.6122, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.075247
Epoch 5555
Loss = 1.8796e-02, PNorm = 624.8922, GNorm = 0.6929, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.082652
Epoch 5556
Loss = 3.1124e-02, PNorm = 624.9697, GNorm = 1.2343, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.082906
Epoch 5557
Loss = 1.8026e-02, PNorm = 625.0069, GNorm = 1.6383, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.077002
Epoch 5558
Loss = 2.8703e-02, PNorm = 625.0400, GNorm = 0.4002, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.081738
Epoch 5559
Loss = 5.6393e-04, PNorm = 625.0883, GNorm = 0.0217, lr_0 = 9.9626e-04
Loss = 2.9437e-02, PNorm = 625.1206, GNorm = 0.0229, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.087559
Epoch 5560
Loss = 7.1563e-03, PNorm = 625.1485, GNorm = 0.1448, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.085788
Epoch 5561
Loss = 2.4310e-02, PNorm = 625.1700, GNorm = 0.0655, lr_0 = 9.9626e-04
Validation binary_cross_entropy = 0.083182
Epoch 5562
Loss = 7.5904e-03, PNorm = 625.1864, GNorm = 0.0718, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.083568
Epoch 5563
Loss = 2.8216e-02, PNorm = 625.2069, GNorm = 0.2839, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.080535
Epoch 5564
Loss = 2.4988e-02, PNorm = 625.2432, GNorm = 4.7400, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.080169
Epoch 5565
Loss = 4.8020e-02, PNorm = 625.2964, GNorm = 3.4901, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.079641
Epoch 5566
Loss = 1.4199e-03, PNorm = 625.3382, GNorm = 0.0341, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.079149
Epoch 5567
Loss = 3.6475e-02, PNorm = 625.3623, GNorm = 2.6682, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.076759
Epoch 5568
Loss = 1.2798e-02, PNorm = 625.3963, GNorm = 0.0343, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.080189
Epoch 5569
Loss = 1.1712e-02, PNorm = 625.4381, GNorm = 0.5003, lr_0 = 9.9625e-04
Loss = 2.0302e-02, PNorm = 625.4720, GNorm = 0.0272, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.079441
Epoch 5570
Loss = 1.0414e-01, PNorm = 625.4950, GNorm = 1.5150, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.067531
Epoch 5571
Loss = 2.3767e-02, PNorm = 625.5421, GNorm = 0.3302, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.067670
Epoch 5572
Loss = 1.9794e-02, PNorm = 625.6011, GNorm = 0.0364, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.080429
Epoch 5573
Loss = 1.5452e-02, PNorm = 625.6451, GNorm = 0.6729, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.075369
Epoch 5574
Loss = 2.0982e-02, PNorm = 625.6768, GNorm = 0.7652, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.072114
Epoch 5575
Loss = 9.4703e-03, PNorm = 625.7100, GNorm = 0.2603, lr_0 = 9.9625e-04
Validation binary_cross_entropy = 0.075027
Epoch 5576
Loss = 1.1566e-02, PNorm = 625.7489, GNorm = 0.0158, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.077280
Epoch 5577
Loss = 1.7447e-02, PNorm = 625.7786, GNorm = 0.7243, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.079054
Epoch 5578
Loss = 2.2662e-02, PNorm = 625.8029, GNorm = 0.0223, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.075696
Epoch 5579
Loss = 2.3569e-02, PNorm = 625.8274, GNorm = 1.4661, lr_0 = 9.9624e-04
Loss = 1.2068e-02, PNorm = 625.8582, GNorm = 0.0091, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.081846
Epoch 5580
Loss = 1.9821e-02, PNorm = 625.8781, GNorm = 1.1596, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.076661
Epoch 5581
Loss = 2.3860e-02, PNorm = 625.9152, GNorm = 0.1218, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.070628
Epoch 5582
Loss = 6.6500e-02, PNorm = 625.9598, GNorm = 0.3596, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.062439
Epoch 5583
Loss = 2.6929e-02, PNorm = 625.9984, GNorm = 0.6473, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.057976
Epoch 5584
Loss = 1.5220e-02, PNorm = 626.0405, GNorm = 0.3598, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.063131
Epoch 5585
Loss = 4.4149e-02, PNorm = 626.1008, GNorm = 1.8399, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.066274
Epoch 5586
Loss = 1.2117e-02, PNorm = 626.1348, GNorm = 0.6481, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.065424
Epoch 5587
Loss = 4.5691e-02, PNorm = 626.1717, GNorm = 0.4373, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.062507
Epoch 5588
Loss = 1.5026e-02, PNorm = 626.2115, GNorm = 0.9169, lr_0 = 9.9624e-04
Validation binary_cross_entropy = 0.064301
Epoch 5589
Loss = 4.8606e-03, PNorm = 626.2533, GNorm = 0.1716, lr_0 = 9.9623e-04
Loss = 1.0541e-02, PNorm = 626.2924, GNorm = 0.2738, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.067443
Epoch 5590
Loss = 7.9721e-03, PNorm = 626.3236, GNorm = 0.1567, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.072107
Epoch 5591
Loss = 2.2301e-03, PNorm = 626.3454, GNorm = 0.3618, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.081963
Epoch 5592
Loss = 1.4538e-02, PNorm = 626.3671, GNorm = 0.1311, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.071965
Epoch 5593
Loss = 9.6055e-03, PNorm = 626.3924, GNorm = 4.9626, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.069194
Epoch 5594
Loss = 1.3210e-02, PNorm = 626.4276, GNorm = 0.2896, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.072054
Epoch 5595
Loss = 2.8200e-03, PNorm = 626.4642, GNorm = 0.0304, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.068594
Epoch 5596
Loss = 9.9454e-03, PNorm = 626.4845, GNorm = 0.3788, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.068788
Epoch 5597
Loss = 5.6910e-03, PNorm = 626.5141, GNorm = 0.8535, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.071051
Epoch 5598
Loss = 1.4338e-03, PNorm = 626.5455, GNorm = 0.0378, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.074838
Epoch 5599
Loss = 1.8318e-03, PNorm = 626.5716, GNorm = 0.0606, lr_0 = 9.9623e-04
Loss = 2.8707e-02, PNorm = 626.6022, GNorm = 0.9094, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.078412
Epoch 5600
Loss = 5.4002e-02, PNorm = 626.6486, GNorm = 0.0404, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.074879
Epoch 5601
Loss = 3.2262e-02, PNorm = 626.7034, GNorm = 0.0282, lr_0 = 9.9623e-04
Validation binary_cross_entropy = 0.073493
Epoch 5602
Loss = 1.3510e-02, PNorm = 626.7536, GNorm = 1.3769, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.080106
Epoch 5603
Loss = 3.7880e-02, PNorm = 626.7868, GNorm = 1.8552, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.076536
Epoch 5604
Loss = 1.6517e-02, PNorm = 626.8190, GNorm = 0.0407, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.094580
Epoch 5605
Loss = 3.1587e-02, PNorm = 626.8655, GNorm = 0.1126, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.089542
Epoch 5606
Loss = 1.6648e-02, PNorm = 626.9143, GNorm = 0.2348, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.075185
Epoch 5607
Loss = 2.2567e-01, PNorm = 626.9872, GNorm = 0.2999, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.066406
Epoch 5608
Loss = 4.2081e-02, PNorm = 627.1227, GNorm = 2.5831, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.072167
Epoch 5609
Loss = 7.2414e-02, PNorm = 627.2238, GNorm = 1.5930, lr_0 = 9.9622e-04
Loss = 2.6552e-02, PNorm = 627.3049, GNorm = 0.0919, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.085390
Epoch 5610
Loss = 7.1824e-02, PNorm = 627.3599, GNorm = 1.3546, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.074325
Epoch 5611
Loss = 5.1937e-02, PNorm = 627.4458, GNorm = 0.1059, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.087076
Epoch 5612
Loss = 4.1763e-02, PNorm = 627.5159, GNorm = 0.7206, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.073275
Epoch 5613
Loss = 2.4430e-02, PNorm = 627.5784, GNorm = 0.7651, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.078215
Epoch 5614
Loss = 2.5352e-02, PNorm = 627.6262, GNorm = 0.5321, lr_0 = 9.9622e-04
Validation binary_cross_entropy = 0.070474
Epoch 5615
Loss = 1.3277e-01, PNorm = 627.6631, GNorm = 16.2133, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.068871
Epoch 5616
Loss = 2.1570e-02, PNorm = 627.7429, GNorm = 0.1086, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.109865
Epoch 5617
Loss = 8.9844e-02, PNorm = 627.8197, GNorm = 0.3704, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.068555
Epoch 5618
Loss = 1.8447e-02, PNorm = 627.8840, GNorm = 0.9628, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.077559
Epoch 5619
Loss = 5.8000e-03, PNorm = 627.9529, GNorm = 0.3191, lr_0 = 9.9621e-04
Loss = 4.6319e-02, PNorm = 628.0084, GNorm = 1.2680, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.068300
Epoch 5620
Loss = 3.9690e-02, PNorm = 628.0634, GNorm = 4.1014, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.075421
Epoch 5621
Loss = 5.8994e-02, PNorm = 628.1339, GNorm = 0.2091, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.077837
Epoch 5622
Loss = 1.6638e-02, PNorm = 628.1791, GNorm = 0.1783, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.068487
Epoch 5623
Loss = 2.0869e-02, PNorm = 628.2345, GNorm = 0.6531, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.065691
Epoch 5624
Loss = 2.3070e-02, PNorm = 628.2924, GNorm = 1.7450, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.064480
Epoch 5625
Loss = 6.1040e-02, PNorm = 628.3499, GNorm = 0.4118, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.061900
Epoch 5626
Loss = 4.1089e-02, PNorm = 628.4078, GNorm = 4.5979, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.065884
Epoch 5627
Loss = 7.9428e-03, PNorm = 628.4496, GNorm = 0.3850, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.067764
Epoch 5628
Loss = 1.1653e-02, PNorm = 628.4925, GNorm = 0.4890, lr_0 = 9.9621e-04
Validation binary_cross_entropy = 0.077761
Epoch 5629
Loss = 4.0863e-03, PNorm = 628.5300, GNorm = 0.1369, lr_0 = 9.9620e-04
Loss = 1.2038e-02, PNorm = 628.5554, GNorm = 0.0276, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.087565
Epoch 5630
Loss = 4.5780e-02, PNorm = 628.5779, GNorm = 0.8117, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.076853
Epoch 5631
Loss = 1.8809e-02, PNorm = 628.6153, GNorm = 1.9464, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.083729
Epoch 5632
Loss = 1.7016e-02, PNorm = 628.6586, GNorm = 0.5203, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.093483
Epoch 5633
Loss = 3.6826e-02, PNorm = 628.6897, GNorm = 0.0522, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.084378
Epoch 5634
Loss = 1.7200e-02, PNorm = 628.7219, GNorm = 3.3848, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.095465
Epoch 5635
Loss = 1.0160e-02, PNorm = 628.7676, GNorm = 0.1478, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.099026
Epoch 5636
Loss = 2.8570e-02, PNorm = 628.8029, GNorm = 2.9598, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.099833
Epoch 5637
Loss = 4.2725e-03, PNorm = 628.8397, GNorm = 0.0175, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.106478
Epoch 5638
Loss = 3.6868e-02, PNorm = 628.8859, GNorm = 9.1401, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.098853
Epoch 5639
Loss = 2.6630e-01, PNorm = 628.9732, GNorm = 5.1113, lr_0 = 9.9620e-04
Loss = 7.7922e-02, PNorm = 629.0753, GNorm = 3.9782, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.121584
Epoch 5640
Loss = 1.2742e-02, PNorm = 629.1644, GNorm = 0.1795, lr_0 = 9.9620e-04
Validation binary_cross_entropy = 0.107465
Epoch 5641
Loss = 1.4761e-02, PNorm = 629.2242, GNorm = 2.3141, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.114328
Epoch 5642
Loss = 6.5913e-03, PNorm = 629.2797, GNorm = 0.0367, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.136326
Epoch 5643
Loss = 2.0687e-02, PNorm = 629.3109, GNorm = 0.0561, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.133736
Epoch 5644
Loss = 4.2674e-02, PNorm = 629.3430, GNorm = 0.1250, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.118625
Epoch 5645
Loss = 9.9465e-02, PNorm = 629.4069, GNorm = 2.0811, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.102024
Epoch 5646
Loss = 1.7417e-02, PNorm = 629.4618, GNorm = 1.1262, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.090809
Epoch 5647
Loss = 1.8081e-02, PNorm = 629.5111, GNorm = 0.6472, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.101181
Epoch 5648
Loss = 1.9699e-03, PNorm = 629.5558, GNorm = 0.0810, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.101361
Epoch 5649
Loss = 9.8085e-02, PNorm = 629.5876, GNorm = 2.0037, lr_0 = 9.9619e-04
Loss = 8.3424e-02, PNorm = 629.6303, GNorm = 0.1405, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.088921
Epoch 5650
Loss = 2.8568e-02, PNorm = 629.6791, GNorm = 2.8159, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.084983
Epoch 5651
Loss = 2.0897e-02, PNorm = 629.7226, GNorm = 2.9379, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.104897
Epoch 5652
Loss = 4.9520e-02, PNorm = 629.7554, GNorm = 4.0631, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.107867
Epoch 5653
Loss = 5.8461e-02, PNorm = 629.8019, GNorm = 2.7436, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.111564
Epoch 5654
Loss = 1.2546e-02, PNorm = 629.8410, GNorm = 1.3938, lr_0 = 9.9619e-04
Validation binary_cross_entropy = 0.099800
Epoch 5655
Loss = 1.4813e-02, PNorm = 629.8822, GNorm = 0.4449, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.100309
Epoch 5656
Loss = 3.2404e-02, PNorm = 629.9324, GNorm = 0.1238, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.085275
Epoch 5657
Loss = 1.3648e-02, PNorm = 629.9870, GNorm = 0.0810, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.081559
Epoch 5658
Loss = 6.3819e-02, PNorm = 630.0531, GNorm = 1.2136, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.092254
Epoch 5659
Loss = 2.0718e-03, PNorm = 630.1213, GNorm = 0.0785, lr_0 = 9.9618e-04
Loss = 2.0586e-02, PNorm = 630.1657, GNorm = 0.0538, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.101168
Epoch 5660
Loss = 5.2326e-02, PNorm = 630.1937, GNorm = 0.5077, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.094653
Epoch 5661
Loss = 2.0923e-02, PNorm = 630.2331, GNorm = 0.7702, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.085903
Epoch 5662
Loss = 3.4842e-02, PNorm = 630.2737, GNorm = 0.2391, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.075910
Epoch 5663
Loss = 9.4654e-03, PNorm = 630.3112, GNorm = 0.0361, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.081272
Epoch 5664
Loss = 3.1352e-02, PNorm = 630.3568, GNorm = 1.4237, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.087954
Epoch 5665
Loss = 5.9231e-03, PNorm = 630.3985, GNorm = 0.5919, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.123338
Epoch 5666
Loss = 9.0111e-02, PNorm = 630.4495, GNorm = 5.9495, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.116852
Epoch 5667
Loss = 2.5453e-02, PNorm = 630.4895, GNorm = 1.4760, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.105170
Epoch 5668
Loss = 5.9980e-02, PNorm = 630.5436, GNorm = 1.6317, lr_0 = 9.9618e-04
Validation binary_cross_entropy = 0.111000
Epoch 5669
Loss = 5.0919e-01, PNorm = 630.6053, GNorm = 7.7226, lr_0 = 9.9617e-04
Loss = 6.0484e-02, PNorm = 630.6703, GNorm = 6.5849, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.091338
Epoch 5670
Loss = 4.0387e-02, PNorm = 630.7424, GNorm = 1.3682, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.101032
Epoch 5671
Loss = 2.0288e-02, PNorm = 630.8103, GNorm = 0.5614, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.098920
Epoch 5672
Loss = 5.8250e-02, PNorm = 630.8747, GNorm = 1.8172, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.090150
Epoch 5673
Loss = 4.6788e-02, PNorm = 630.9460, GNorm = 0.9845, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.091903
Epoch 5674
Loss = 3.2004e-02, PNorm = 631.0180, GNorm = 0.6167, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.086218
Epoch 5675
Loss = 3.3832e-02, PNorm = 631.0749, GNorm = 0.5415, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.071779
Epoch 5676
Loss = 2.7354e-02, PNorm = 631.1419, GNorm = 0.2444, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.074962
Epoch 5677
Loss = 1.8651e-02, PNorm = 631.2141, GNorm = 1.1925, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.081606
Epoch 5678
Loss = 6.3438e-03, PNorm = 631.2679, GNorm = 0.4699, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.082796
Epoch 5679
Loss = 3.5919e-03, PNorm = 631.3060, GNorm = 0.2094, lr_0 = 9.9617e-04
Loss = 2.2075e-02, PNorm = 631.3429, GNorm = 0.0589, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.089378
Epoch 5680
Loss = 1.9555e-02, PNorm = 631.3700, GNorm = 1.7118, lr_0 = 9.9617e-04
Validation binary_cross_entropy = 0.089178
Epoch 5681
Loss = 2.4523e-02, PNorm = 631.4005, GNorm = 0.5709, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.093683
Epoch 5682
Loss = 3.0959e-02, PNorm = 631.4441, GNorm = 1.7793, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.095073
Epoch 5683
Loss = 1.0802e-02, PNorm = 631.4805, GNorm = 2.8385, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.094091
Epoch 5684
Loss = 4.3293e-03, PNorm = 631.5119, GNorm = 0.0421, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.095012
Epoch 5685
Loss = 1.7512e-02, PNorm = 631.5334, GNorm = 1.0696, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.088352
Epoch 5686
Loss = 1.4339e-02, PNorm = 631.5593, GNorm = 0.1596, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.088009
Epoch 5687
Loss = 5.2253e-02, PNorm = 631.5931, GNorm = 0.0468, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.096652
Epoch 5688
Loss = 5.5667e-02, PNorm = 631.6384, GNorm = 1.6983, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.091109
Epoch 5689
Loss = 2.8954e-03, PNorm = 631.6709, GNorm = 0.2750, lr_0 = 9.9616e-04
Loss = 4.1138e-02, PNorm = 631.7095, GNorm = 0.3801, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.085886
Epoch 5690
Loss = 1.9264e-02, PNorm = 631.7565, GNorm = 1.4913, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.080986
Epoch 5691
Loss = 2.0373e-02, PNorm = 631.8058, GNorm = 3.3802, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.083043
Epoch 5692
Loss = 2.4732e-02, PNorm = 631.8466, GNorm = 1.6849, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.085305
Epoch 5693
Loss = 1.5526e-02, PNorm = 631.8941, GNorm = 0.7427, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.129736
Epoch 5694
Loss = 2.4239e-02, PNorm = 631.9394, GNorm = 1.6989, lr_0 = 9.9616e-04
Validation binary_cross_entropy = 0.136096
Epoch 5695
Loss = 1.3585e-02, PNorm = 631.9772, GNorm = 0.4396, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.127659
Epoch 5696
Loss = 5.9171e-03, PNorm = 632.0084, GNorm = 0.0221, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.108333
Epoch 5697
Loss = 9.3935e-02, PNorm = 632.0316, GNorm = 3.8351, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.097212
Epoch 5698
Loss = 5.0184e-02, PNorm = 632.0938, GNorm = 0.2766, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.086077
Epoch 5699
Loss = 7.9032e-02, PNorm = 632.1499, GNorm = 1.4751, lr_0 = 9.9615e-04
Loss = 2.0928e-02, PNorm = 632.2009, GNorm = 0.3025, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.081689
Epoch 5700
Loss = 3.2510e-02, PNorm = 632.2559, GNorm = 0.2594, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.093985
Epoch 5701
Loss = 1.3940e-02, PNorm = 632.3039, GNorm = 0.6238, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.113082
Epoch 5702
Loss = 1.5534e-02, PNorm = 632.3390, GNorm = 0.0526, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.115392
Epoch 5703
Loss = 4.7760e-02, PNorm = 632.3755, GNorm = 0.0677, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.100587
Epoch 5704
Loss = 6.2869e-03, PNorm = 632.4266, GNorm = 0.0326, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.101697
Epoch 5705
Loss = 3.5315e-02, PNorm = 632.4614, GNorm = 2.0654, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.083467
Epoch 5706
Loss = 7.4801e-02, PNorm = 632.4842, GNorm = 0.2309, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.061811
Epoch 5707
Loss = 3.1771e-02, PNorm = 632.5519, GNorm = 0.7693, lr_0 = 9.9615e-04
Validation binary_cross_entropy = 0.068142
Epoch 5708
Loss = 1.8054e-02, PNorm = 632.6357, GNorm = 0.1902, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.070427
Epoch 5709
Loss = 1.3641e-03, PNorm = 632.6862, GNorm = 0.0464, lr_0 = 9.9614e-04
Loss = 2.0712e-02, PNorm = 632.7328, GNorm = 3.2496, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.067085
Epoch 5710
Loss = 4.6609e-02, PNorm = 632.7865, GNorm = 1.1937, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.057004
Epoch 5711
Loss = 3.9709e-02, PNorm = 632.8402, GNorm = 0.1909, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.059288
Epoch 5712
Loss = 1.8579e-02, PNorm = 632.8853, GNorm = 0.2575, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.060863
Epoch 5713
Loss = 2.5699e-02, PNorm = 632.9177, GNorm = 2.6441, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.062451
Epoch 5714
Loss = 1.6406e-02, PNorm = 632.9516, GNorm = 0.1416, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.070463
Epoch 5715
Loss = 2.6326e-02, PNorm = 632.9952, GNorm = 2.7789, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.072157
Epoch 5716
Loss = 6.3987e-03, PNorm = 633.0380, GNorm = 1.0092, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.089448
Epoch 5717
Loss = 3.2863e-02, PNorm = 633.0737, GNorm = 3.2267, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.097037
Epoch 5718
Loss = 4.2084e-03, PNorm = 633.1020, GNorm = 0.5021, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.077851
Epoch 5719
Loss = 6.9458e-04, PNorm = 633.1224, GNorm = 0.0412, lr_0 = 9.9614e-04
Loss = 1.0076e-02, PNorm = 633.1647, GNorm = 0.1209, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.081468
Epoch 5720
Loss = 4.6878e-02, PNorm = 633.2019, GNorm = 0.9680, lr_0 = 9.9614e-04
Validation binary_cross_entropy = 0.078596
Epoch 5721
Loss = 2.0217e-01, PNorm = 633.2456, GNorm = 59.1518, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.052828
Epoch 5722
Loss = 1.4636e-01, PNorm = 633.3702, GNorm = 4.0958, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.126961
Epoch 5723
Loss = 8.6477e-02, PNorm = 633.4925, GNorm = 5.3685, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.064323
Epoch 5724
Loss = 8.5950e-02, PNorm = 633.5929, GNorm = 0.8259, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.077815
Epoch 5725
Loss = 3.3014e-02, PNorm = 633.6623, GNorm = 0.3682, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.070101
Epoch 5726
Loss = 3.4592e-02, PNorm = 633.7175, GNorm = 3.7435, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.059679
Epoch 5727
Loss = 8.8733e-02, PNorm = 633.7937, GNorm = 4.3711, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.070055
Epoch 5728
Loss = 3.3974e-02, PNorm = 633.8592, GNorm = 1.4608, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.063506
Epoch 5729
Loss = 8.5889e-03, PNorm = 633.9164, GNorm = 0.3776, lr_0 = 9.9613e-04
Loss = 3.5132e-02, PNorm = 633.9761, GNorm = 2.5900, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.069379
Epoch 5730
Loss = 7.0088e-02, PNorm = 634.0383, GNorm = 1.7433, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.070307
Epoch 5731
Loss = 6.4726e-02, PNorm = 634.1101, GNorm = 2.0482, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.084158
Epoch 5732
Loss = 2.4044e-02, PNorm = 634.1722, GNorm = 0.4457, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.077184
Epoch 5733
Loss = 1.1206e-02, PNorm = 634.2107, GNorm = 0.7948, lr_0 = 9.9613e-04
Validation binary_cross_entropy = 0.075271
Epoch 5734
Loss = 3.7484e-02, PNorm = 634.2475, GNorm = 0.0569, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.071111
Epoch 5735
Loss = 4.9626e-03, PNorm = 634.2894, GNorm = 0.0930, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.082719
Epoch 5736
Loss = 4.8313e-02, PNorm = 634.3246, GNorm = 3.6420, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.093322
Epoch 5737
Loss = 6.7257e-03, PNorm = 634.3934, GNorm = 1.3873, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.103083
Epoch 5738
Loss = 2.3638e-02, PNorm = 634.4385, GNorm = 1.3025, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.096092
Epoch 5739
Loss = 3.5070e-03, PNorm = 634.4761, GNorm = 0.8310, lr_0 = 9.9612e-04
Loss = 3.3463e-01, PNorm = 634.5228, GNorm = 60.0566, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.089847
Epoch 5740
Loss = 4.0980e-02, PNorm = 634.6429, GNorm = 1.4705, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.106282
Epoch 5741
Loss = 1.4200e-01, PNorm = 634.7472, GNorm = 1.8739, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.136085
Epoch 5742
Loss = 9.2872e-02, PNorm = 634.8278, GNorm = 3.4288, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.064249
Epoch 5743
Loss = 8.9526e-02, PNorm = 634.9064, GNorm = 0.4800, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.076783
Epoch 5744
Loss = 2.6219e-02, PNorm = 634.9876, GNorm = 0.5737, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.070209
Epoch 5745
Loss = 7.8186e-02, PNorm = 635.0551, GNorm = 0.9271, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.074775
Epoch 5746
Loss = 2.2981e-02, PNorm = 635.1151, GNorm = 0.1196, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.071064
Epoch 5747
Loss = 1.5427e-02, PNorm = 635.1661, GNorm = 0.8075, lr_0 = 9.9612e-04
Validation binary_cross_entropy = 0.086233
Epoch 5748
Loss = 2.7513e-02, PNorm = 635.2216, GNorm = 0.6808, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.098013
Epoch 5749
Loss = 4.3653e-02, PNorm = 635.2612, GNorm = 0.8694, lr_0 = 9.9611e-04
Loss = 7.3028e-02, PNorm = 635.3099, GNorm = 12.4336, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.080250
Epoch 5750
Loss = 3.7536e-02, PNorm = 635.3720, GNorm = 1.2281, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.102949
Epoch 5751
Loss = 4.6520e-02, PNorm = 635.4164, GNorm = 1.0056, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.090023
Epoch 5752
Loss = 2.0239e-02, PNorm = 635.4611, GNorm = 3.6404, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.096856
Epoch 5753
Loss = 3.9635e-02, PNorm = 635.5041, GNorm = 4.4267, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.101123
Epoch 5754
Loss = 3.2467e-02, PNorm = 635.5384, GNorm = 0.0292, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.082093
Epoch 5755
Loss = 2.3729e-02, PNorm = 635.5867, GNorm = 4.4344, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.092513
Epoch 5756
Loss = 2.8558e-02, PNorm = 635.6340, GNorm = 1.8460, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.091980
Epoch 5757
Loss = 5.9766e-02, PNorm = 635.6778, GNorm = 5.7627, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.107705
Epoch 5758
Loss = 9.0115e-02, PNorm = 635.7411, GNorm = 4.1801, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.089764
Epoch 5759
Loss = 5.0655e-04, PNorm = 635.7844, GNorm = 0.0197, lr_0 = 9.9611e-04
Loss = 6.0752e-02, PNorm = 635.8447, GNorm = 0.2000, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.076052
Epoch 5760
Loss = 3.5833e-02, PNorm = 635.9187, GNorm = 1.5122, lr_0 = 9.9611e-04
Validation binary_cross_entropy = 0.100482
Epoch 5761
Loss = 5.5327e-02, PNorm = 636.0002, GNorm = 10.8549, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.095272
Epoch 5762
Loss = 4.8463e-02, PNorm = 636.1049, GNorm = 1.2767, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.104744
Epoch 5763
Loss = 3.5974e-02, PNorm = 636.1687, GNorm = 0.1301, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.091160
Epoch 5764
Loss = 5.0176e-02, PNorm = 636.2287, GNorm = 0.4843, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.092901
Epoch 5765
Loss = 2.4476e-02, PNorm = 636.2934, GNorm = 0.5713, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.080286
Epoch 5766
Loss = 6.1092e-02, PNorm = 636.3688, GNorm = 0.8607, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.084032
Epoch 5767
Loss = 2.9906e-02, PNorm = 636.4320, GNorm = 0.8061, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.075812
Epoch 5768
Loss = 1.0211e-02, PNorm = 636.4853, GNorm = 0.7956, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.084442
Epoch 5769
Loss = 2.1329e-02, PNorm = 636.5316, GNorm = 0.8842, lr_0 = 9.9610e-04
Loss = 2.6137e-02, PNorm = 636.5669, GNorm = 1.1334, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.083532
Epoch 5770
Loss = 6.0656e-03, PNorm = 636.6010, GNorm = 0.0070, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.091701
Epoch 5771
Loss = 1.1227e-02, PNorm = 636.6265, GNorm = 0.0665, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.093177
Epoch 5772
Loss = 4.0443e-03, PNorm = 636.6410, GNorm = 0.0316, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.091806
Epoch 5773
Loss = 7.1366e-02, PNorm = 636.6866, GNorm = 0.8617, lr_0 = 9.9610e-04
Validation binary_cross_entropy = 0.118390
Epoch 5774
Loss = 1.1999e-01, PNorm = 636.7116, GNorm = 1.4228, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.060384
Epoch 5775
Loss = 3.2242e-02, PNorm = 636.7461, GNorm = 0.5173, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.065711
Epoch 5776
Loss = 1.5682e-02, PNorm = 636.8187, GNorm = 0.3805, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.080264
Epoch 5777
Loss = 1.0122e-02, PNorm = 636.8861, GNorm = 0.2951, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.091615
Epoch 5778
Loss = 6.0712e-03, PNorm = 636.9757, GNorm = 0.3253, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.081352
Epoch 5779
Loss = 5.0702e-03, PNorm = 637.0476, GNorm = 0.1628, lr_0 = 9.9609e-04
Loss = 1.7276e-02, PNorm = 637.1054, GNorm = 0.6101, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.086927
Epoch 5780
Loss = 2.2570e-02, PNorm = 637.1570, GNorm = 0.0454, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.090082
Epoch 5781
Loss = 3.4953e-02, PNorm = 637.2182, GNorm = 1.4610, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.089248
Epoch 5782
Loss = 6.2140e-02, PNorm = 637.2819, GNorm = 2.0449, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.095739
Epoch 5783
Loss = 4.5321e-02, PNorm = 637.3440, GNorm = 0.0808, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.086061
Epoch 5784
Loss = 2.7780e-02, PNorm = 637.3937, GNorm = 1.5636, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.102727
Epoch 5785
Loss = 3.4998e-02, PNorm = 637.4417, GNorm = 0.4963, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.132318
Epoch 5786
Loss = 7.3479e-03, PNorm = 637.4808, GNorm = 0.1198, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.121424
Epoch 5787
Loss = 1.3331e-01, PNorm = 637.5089, GNorm = 1.2081, lr_0 = 9.9609e-04
Validation binary_cross_entropy = 0.101052
Epoch 5788
Loss = 2.8104e-03, PNorm = 637.5437, GNorm = 0.0333, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.093570
Epoch 5789
Loss = 1.0535e-03, PNorm = 637.5817, GNorm = 0.0445, lr_0 = 9.9608e-04
Loss = 3.7612e-02, PNorm = 637.6292, GNorm = 1.3540, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.081608
Epoch 5790
Loss = 4.9472e-02, PNorm = 637.6835, GNorm = 0.7867, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.078215
Epoch 5791
Loss = 3.3217e-02, PNorm = 637.7193, GNorm = 0.7485, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.072161
Epoch 5792
Loss = 2.2278e-02, PNorm = 637.7603, GNorm = 0.1387, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.074387
Epoch 5793
Loss = 4.2056e-02, PNorm = 637.8069, GNorm = 1.7721, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.083057
Epoch 5794
Loss = 9.4896e-02, PNorm = 637.8639, GNorm = 0.0210, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.120618
Epoch 5795
Loss = 8.4373e-03, PNorm = 637.9240, GNorm = 1.0541, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.170126
Epoch 5796
Loss = 4.4928e-02, PNorm = 637.9709, GNorm = 2.2579, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.175035
Epoch 5797
Loss = 1.8814e-02, PNorm = 638.0255, GNorm = 1.0522, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.257020
Epoch 5798
Loss = 1.6638e-03, PNorm = 638.0793, GNorm = 0.0753, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.263427
Epoch 5799
Loss = 4.9275e-04, PNorm = 638.1294, GNorm = 0.0283, lr_0 = 9.9608e-04
Loss = 5.8910e-02, PNorm = 638.1944, GNorm = 10.9253, lr_0 = 9.9608e-04
Validation binary_cross_entropy = 0.212695
Epoch 5800
Loss = 5.8222e-02, PNorm = 638.2725, GNorm = 1.2147, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.073613
Epoch 5801
Loss = 6.7641e-02, PNorm = 638.3593, GNorm = 2.1828, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.066159
Epoch 5802
Loss = 2.6912e-01, PNorm = 638.5879, GNorm = 8.4903, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.217234
Epoch 5803
Loss = 1.6722e-01, PNorm = 638.8318, GNorm = 1.9020, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.100170
Epoch 5804
Loss = 1.2389e-01, PNorm = 638.9862, GNorm = 1.5903, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.133813
Epoch 5805
Loss = 6.1410e-02, PNorm = 639.1012, GNorm = 1.5482, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.102268
Epoch 5806
Loss = 4.3738e-02, PNorm = 639.1813, GNorm = 2.4686, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.096173
Epoch 5807
Loss = 6.5806e-02, PNorm = 639.2562, GNorm = 2.3618, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.103279
Epoch 5808
Loss = 5.5278e-02, PNorm = 639.3168, GNorm = 1.7762, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.092001
Epoch 5809
Loss = 2.1506e-01, PNorm = 639.3563, GNorm = 3.1832, lr_0 = 9.9607e-04
Loss = 2.8969e-02, PNorm = 639.3993, GNorm = 2.7631, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.088804
Epoch 5810
Loss = 3.5093e-02, PNorm = 639.4471, GNorm = 0.0177, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.125605
Epoch 5811
Loss = 3.6086e-02, PNorm = 639.4761, GNorm = 2.5165, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.088365
Epoch 5812
Loss = 1.1328e-02, PNorm = 639.5174, GNorm = 0.6162, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.108388
Epoch 5813
Loss = 5.4930e-02, PNorm = 639.5412, GNorm = 0.6653, lr_0 = 9.9607e-04
Validation binary_cross_entropy = 0.102315
Epoch 5814
Loss = 9.1242e-03, PNorm = 639.5702, GNorm = 0.0355, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.117023
Epoch 5815
Loss = 1.6612e-02, PNorm = 639.5959, GNorm = 0.9864, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.128567
Epoch 5816
Loss = 2.0500e-02, PNorm = 639.6251, GNorm = 1.7430, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.122398
Epoch 5817
Loss = 9.7272e-02, PNorm = 639.6608, GNorm = 4.5729, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.101371
Epoch 5818
Loss = 4.0410e-02, PNorm = 639.7391, GNorm = 1.7136, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.084651
Epoch 5819
Loss = 1.3209e-02, PNorm = 639.8072, GNorm = 0.4069, lr_0 = 9.9606e-04
Loss = 4.3431e-02, PNorm = 639.8822, GNorm = 0.6451, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.114225
Epoch 5820
Loss = 2.9207e-02, PNorm = 639.9376, GNorm = 0.1884, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.103119
Epoch 5821
Loss = 6.1305e-02, PNorm = 639.9737, GNorm = 0.4358, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.086736
Epoch 5822
Loss = 3.0082e-02, PNorm = 640.0197, GNorm = 0.3714, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.087250
Epoch 5823
Loss = 3.3892e-02, PNorm = 640.0667, GNorm = 0.7482, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.089683
Epoch 5824
Loss = 5.8159e-02, PNorm = 640.0991, GNorm = 1.4054, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.086255
Epoch 5825
Loss = 1.9067e-02, PNorm = 640.1230, GNorm = 2.2127, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.088220
Epoch 5826
Loss = 1.8438e-02, PNorm = 640.1482, GNorm = 0.8151, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.086060
Epoch 5827
Loss = 3.1751e-02, PNorm = 640.1880, GNorm = 0.7215, lr_0 = 9.9606e-04
Validation binary_cross_entropy = 0.085158
Epoch 5828
Loss = 1.6478e-02, PNorm = 640.2735, GNorm = 1.0450, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.130812
Epoch 5829
Loss = 8.1561e-02, PNorm = 640.3451, GNorm = 2.9164, lr_0 = 9.9605e-04
Loss = 6.4876e-02, PNorm = 640.3839, GNorm = 1.6915, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.087508
Epoch 5830
Loss = 3.3575e-02, PNorm = 640.4276, GNorm = 1.2727, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.091150
Epoch 5831
Loss = 8.4819e-02, PNorm = 640.4779, GNorm = 2.4134, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.104339
Epoch 5832
Loss = 8.1487e-02, PNorm = 640.5234, GNorm = 2.0051, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.101490
Epoch 5833
Loss = 4.4508e-02, PNorm = 640.5611, GNorm = 0.4684, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.059728
Epoch 5834
Loss = 5.3172e-02, PNorm = 640.6112, GNorm = 2.4763, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.074871
Epoch 5835
Loss = 3.8082e-02, PNorm = 640.6624, GNorm = 1.2359, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.069860
Epoch 5836
Loss = 1.8808e-02, PNorm = 640.7202, GNorm = 2.5791, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.110415
Epoch 5837
Loss = 5.4364e-03, PNorm = 640.7766, GNorm = 0.0280, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.101904
Epoch 5838
Loss = 3.8957e-02, PNorm = 640.8091, GNorm = 1.6133, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.080412
Epoch 5839
Loss = 1.1890e-02, PNorm = 640.8325, GNorm = 0.8298, lr_0 = 9.9605e-04
Loss = 5.0253e-02, PNorm = 640.8665, GNorm = 5.3452, lr_0 = 9.9605e-04
Validation binary_cross_entropy = 0.081085
Epoch 5840
Loss = 5.6414e-02, PNorm = 640.9014, GNorm = 1.1659, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.080118
Epoch 5841
Loss = 2.0454e-02, PNorm = 640.9408, GNorm = 0.1968, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.086145
Epoch 5842
Loss = 4.3040e-02, PNorm = 640.9651, GNorm = 0.1040, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.079236
Epoch 5843
Loss = 8.8172e-03, PNorm = 640.9910, GNorm = 0.3105, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.084089
Epoch 5844
Loss = 1.1261e-02, PNorm = 641.0176, GNorm = 1.0568, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.094845
Epoch 5845
Loss = 5.6427e-02, PNorm = 641.0336, GNorm = 2.8696, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.081965
Epoch 5846
Loss = 1.4904e-02, PNorm = 641.0560, GNorm = 0.4240, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.084220
Epoch 5847
Loss = 8.3102e-03, PNorm = 641.1011, GNorm = 0.2866, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.104054
Epoch 5848
Loss = 1.0041e-01, PNorm = 641.1441, GNorm = 2.9407, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.081279
Epoch 5849
Loss = 3.0670e-02, PNorm = 641.1875, GNorm = 1.3588, lr_0 = 9.9604e-04
Loss = 2.1669e-02, PNorm = 641.2343, GNorm = 1.5883, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.078513
Epoch 5850
Loss = 3.0815e-02, PNorm = 641.2811, GNorm = 1.5110, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.081953
Epoch 5851
Loss = 1.9777e-02, PNorm = 641.3206, GNorm = 1.0606, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.093611
Epoch 5852
Loss = 2.2230e-03, PNorm = 641.3483, GNorm = 0.0042, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.104422
Epoch 5853
Loss = 9.6711e-03, PNorm = 641.3631, GNorm = 0.3673, lr_0 = 9.9604e-04
Validation binary_cross_entropy = 0.101659
Epoch 5854
Loss = 2.0621e-02, PNorm = 641.3748, GNorm = 3.8804, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.090293
Epoch 5855
Loss = 1.2502e-01, PNorm = 641.4076, GNorm = 9.6604, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.152618
Epoch 5856
Loss = 1.3422e-01, PNorm = 641.4917, GNorm = 2.6996, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.061306
Epoch 5857
Loss = 9.3124e-02, PNorm = 641.5775, GNorm = 0.8904, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.091138
Epoch 5858
Loss = 1.8264e-02, PNorm = 641.6702, GNorm = 1.2059, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.080175
Epoch 5859
Loss = 5.6095e-03, PNorm = 641.7162, GNorm = 0.1664, lr_0 = 9.9603e-04
Loss = 4.0879e-02, PNorm = 641.7608, GNorm = 0.4765, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.081255
Epoch 5860
Loss = 1.4819e-02, PNorm = 641.7989, GNorm = 0.2832, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.086569
Epoch 5861
Loss = 4.7913e-02, PNorm = 641.8234, GNorm = 0.0464, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.090572
Epoch 5862
Loss = 1.7774e-02, PNorm = 641.8518, GNorm = 3.0676, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.156624
Epoch 5863
Loss = 1.2049e-01, PNorm = 641.9146, GNorm = 1.5676, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.079484
Epoch 5864
Loss = 6.2130e-02, PNorm = 641.9784, GNorm = 0.4448, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.077962
Epoch 5865
Loss = 3.8580e-02, PNorm = 642.0471, GNorm = 1.7985, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.082015
Epoch 5866
Loss = 6.0558e-02, PNorm = 642.0953, GNorm = 1.1306, lr_0 = 9.9603e-04
Validation binary_cross_entropy = 0.073597
Epoch 5867
Loss = 2.8411e-02, PNorm = 642.1404, GNorm = 0.0991, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.091912
Epoch 5868
Loss = 3.0074e-02, PNorm = 642.1824, GNorm = 0.3318, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.111166
Epoch 5869
Loss = 1.4583e-01, PNorm = 642.2245, GNorm = 2.7802, lr_0 = 9.9602e-04
Loss = 4.3485e-02, PNorm = 642.2681, GNorm = 0.7600, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.081542
Epoch 5870
Loss = 9.0316e-02, PNorm = 642.3264, GNorm = 1.2851, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.116432
Epoch 5871
Loss = 2.3782e-02, PNorm = 642.3838, GNorm = 0.0930, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.086823
Epoch 5872
Loss = 4.1383e-02, PNorm = 642.4355, GNorm = 0.1711, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.092777
Epoch 5873
Loss = 1.1124e-01, PNorm = 642.5058, GNorm = 3.0773, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.092119
Epoch 5874
Loss = 2.0806e-02, PNorm = 642.5559, GNorm = 0.0649, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.104910
Epoch 5875
Loss = 1.9967e-02, PNorm = 642.5968, GNorm = 3.8971, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.111139
Epoch 5876
Loss = 1.0472e-02, PNorm = 642.6212, GNorm = 0.0736, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.118571
Epoch 5877
Loss = 5.3326e-02, PNorm = 642.6437, GNorm = 1.9087, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.090239
Epoch 5878
Loss = 4.4534e-02, PNorm = 642.6691, GNorm = 0.7234, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.091961
Epoch 5879
Loss = 4.3506e-03, PNorm = 642.7156, GNorm = 0.1570, lr_0 = 9.9602e-04
Loss = 6.0816e-02, PNorm = 642.7608, GNorm = 2.2247, lr_0 = 9.9602e-04
Validation binary_cross_entropy = 0.092547
Epoch 5880
Loss = 4.0020e-02, PNorm = 642.8062, GNorm = 0.3594, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.131382
Epoch 5881
Loss = 1.9547e-02, PNorm = 642.8408, GNorm = 1.1024, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.164216
Epoch 5882
Loss = 1.7454e-01, PNorm = 642.8763, GNorm = 0.4668, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.101973
Epoch 5883
Loss = 9.3230e-02, PNorm = 642.9418, GNorm = 0.7645, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.137822
Epoch 5884
Loss = 3.8007e-02, PNorm = 642.9949, GNorm = 0.1354, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.090653
Epoch 5885
Loss = 7.1874e-02, PNorm = 643.0657, GNorm = 2.9617, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.116322
Epoch 5886
Loss = 6.4006e-02, PNorm = 643.1269, GNorm = 0.5674, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.083059
Epoch 5887
Loss = 2.8568e-02, PNorm = 643.1785, GNorm = 0.7657, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.108099
Epoch 5888
Loss = 9.5583e-02, PNorm = 643.2383, GNorm = 2.9775, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.104665
Epoch 5889
Loss = 1.3320e-02, PNorm = 643.2956, GNorm = 0.5781, lr_0 = 9.9601e-04
Loss = 5.8578e-02, PNorm = 643.3391, GNorm = 0.1128, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.090364
Epoch 5890
Loss = 5.2272e-02, PNorm = 643.4006, GNorm = 0.4252, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.091095
Epoch 5891
Loss = 3.1199e-02, PNorm = 643.4622, GNorm = 2.0998, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.101782
Epoch 5892
Loss = 4.8074e-02, PNorm = 643.5093, GNorm = 5.3316, lr_0 = 9.9601e-04
Validation binary_cross_entropy = 0.092007
Epoch 5893
Loss = 3.3686e-02, PNorm = 643.5492, GNorm = 4.1836, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.091071
Epoch 5894
Loss = 1.4557e-02, PNorm = 643.5895, GNorm = 1.9265, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.095133
Epoch 5895
Loss = 5.2554e-02, PNorm = 643.6253, GNorm = 0.1224, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.107417
Epoch 5896
Loss = 2.3063e-02, PNorm = 643.6689, GNorm = 0.8124, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.100511
Epoch 5897
Loss = 1.7207e-02, PNorm = 643.7057, GNorm = 0.2069, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.091358
Epoch 5898
Loss = 6.2707e-02, PNorm = 643.7472, GNorm = 1.4088, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.105147
Epoch 5899
Loss = 3.1926e-03, PNorm = 643.7987, GNorm = 0.1137, lr_0 = 9.9600e-04
Loss = 3.6590e-02, PNorm = 643.8522, GNorm = 0.2421, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.124468
Epoch 5900
Loss = 1.8204e-02, PNorm = 643.8954, GNorm = 3.4098, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.162865
Epoch 5901
Loss = 2.9737e-02, PNorm = 643.9253, GNorm = 0.0876, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.157441
Epoch 5902
Loss = 3.5881e-02, PNorm = 643.9492, GNorm = 6.0778, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.121452
Epoch 5903
Loss = 5.3367e-03, PNorm = 643.9959, GNorm = 0.2557, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.115415
Epoch 5904
Loss = 7.5825e-03, PNorm = 644.0421, GNorm = 0.1164, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.125120
Epoch 5905
Loss = 3.2459e-02, PNorm = 644.0740, GNorm = 0.0691, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.128920
Epoch 5906
Loss = 5.6857e-03, PNorm = 644.1025, GNorm = 0.2083, lr_0 = 9.9600e-04
Validation binary_cross_entropy = 0.153049
Epoch 5907
Loss = 8.1954e-03, PNorm = 644.1256, GNorm = 0.4736, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.128524
Epoch 5908
Loss = 7.8594e-02, PNorm = 644.1585, GNorm = 4.2665, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.155794
Epoch 5909
Loss = 3.6499e-04, PNorm = 644.2243, GNorm = 0.0160, lr_0 = 9.9599e-04
Loss = 2.6999e-02, PNorm = 644.2689, GNorm = 0.0760, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.158281
Epoch 5910
Loss = 7.7232e-02, PNorm = 644.3188, GNorm = 1.2663, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.120510
Epoch 5911
Loss = 3.9830e-02, PNorm = 644.3906, GNorm = 0.8582, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.161893
Epoch 5912
Loss = 4.8044e-02, PNorm = 644.4401, GNorm = 0.4674, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.116052
Epoch 5913
Loss = 3.2303e-02, PNorm = 644.4749, GNorm = 2.4195, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.109157
Epoch 5914
Loss = 2.4114e-02, PNorm = 644.5236, GNorm = 0.0498, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.111392
Epoch 5915
Loss = 1.1650e-02, PNorm = 644.5638, GNorm = 0.1373, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.106295
Epoch 5916
Loss = 4.1795e-03, PNorm = 644.5941, GNorm = 0.4688, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.107525
Epoch 5917
Loss = 2.1847e-03, PNorm = 644.6161, GNorm = 0.0510, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.114473
Epoch 5918
Loss = 8.2648e-03, PNorm = 644.6376, GNorm = 0.1324, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.123957
Epoch 5919
Loss = 7.6260e-04, PNorm = 644.6634, GNorm = 0.0408, lr_0 = 9.9599e-04
Loss = 2.5427e-02, PNorm = 644.6974, GNorm = 0.2827, lr_0 = 9.9599e-04
Validation binary_cross_entropy = 0.131820
Epoch 5920
Loss = 1.0304e-02, PNorm = 644.7285, GNorm = 0.0101, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.144651
Epoch 5921
Loss = 3.5518e-02, PNorm = 644.7542, GNorm = 2.3276, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.145228
Epoch 5922
Loss = 1.1381e-02, PNorm = 644.7780, GNorm = 0.3120, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.139997
Epoch 5923
Loss = 2.6224e-02, PNorm = 644.8074, GNorm = 0.6290, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.143043
Epoch 5924
Loss = 9.9869e-02, PNorm = 644.8351, GNorm = 0.0631, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.117610
Epoch 5925
Loss = 2.5432e-02, PNorm = 644.8624, GNorm = 1.7631, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.115545
Epoch 5926
Loss = 7.2481e-03, PNorm = 644.9009, GNorm = 0.5198, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.110727
Epoch 5927
Loss = 7.8184e-02, PNorm = 644.9301, GNorm = 1.1911, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.087844
Epoch 5928
Loss = 8.9714e-03, PNorm = 644.9635, GNorm = 0.2800, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.077981
Epoch 5929
Loss = 6.4004e-03, PNorm = 645.0112, GNorm = 0.2638, lr_0 = 9.9598e-04
Loss = 2.4388e-02, PNorm = 645.0518, GNorm = 0.3671, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.079622
Epoch 5930
Loss = 4.6447e-02, PNorm = 645.0851, GNorm = 3.9200, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.083916
Epoch 5931
Loss = 1.1914e-02, PNorm = 645.1166, GNorm = 0.7264, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.087639
Epoch 5932
Loss = 2.1084e-02, PNorm = 645.1403, GNorm = 2.5044, lr_0 = 9.9598e-04
Validation binary_cross_entropy = 0.084238
Epoch 5933
Loss = 4.3839e-02, PNorm = 645.1672, GNorm = 0.5948, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.083068
Epoch 5934
Loss = 1.6805e-02, PNorm = 645.1960, GNorm = 0.1763, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.084070
Epoch 5935
Loss = 2.7162e-02, PNorm = 645.2372, GNorm = 0.9947, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.100252
Epoch 5936
Loss = 7.9645e-02, PNorm = 645.2676, GNorm = 0.5287, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.091594
Epoch 5937
Loss = 2.2210e-02, PNorm = 645.2862, GNorm = 0.6716, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.085931
Epoch 5938
Loss = 5.5760e-03, PNorm = 645.3042, GNorm = 0.5370, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.095022
Epoch 5939
Loss = 1.9227e-02, PNorm = 645.3301, GNorm = 1.2998, lr_0 = 9.9597e-04
Loss = 1.3125e-02, PNorm = 645.3544, GNorm = 0.1076, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.101454
Epoch 5940
Loss = 3.2915e-02, PNorm = 645.3760, GNorm = 1.5737, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.102842
Epoch 5941
Loss = 9.5938e-02, PNorm = 645.4040, GNorm = 2.8836, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.084712
Epoch 5942
Loss = 4.2647e-02, PNorm = 645.4516, GNorm = 0.1641, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.097857
Epoch 5943
Loss = 6.3589e-02, PNorm = 645.4962, GNorm = 0.4266, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.094187
Epoch 5944
Loss = 4.4097e-02, PNorm = 645.5247, GNorm = 8.0171, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.098903
Epoch 5945
Loss = 2.0545e-02, PNorm = 645.5575, GNorm = 0.2296, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.107334
Epoch 5946
Loss = 2.9055e-02, PNorm = 645.5916, GNorm = 2.5249, lr_0 = 9.9597e-04
Validation binary_cross_entropy = 0.083020
Epoch 5947
Loss = 3.1366e-02, PNorm = 645.6246, GNorm = 0.2858, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.086601
Epoch 5948
Loss = 2.2365e-02, PNorm = 645.6689, GNorm = 1.0377, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.091553
Epoch 5949
Loss = 1.2095e-03, PNorm = 645.7103, GNorm = 0.0540, lr_0 = 9.9596e-04
Loss = 2.3480e-02, PNorm = 645.7553, GNorm = 1.6781, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.089270
Epoch 5950
Loss = 7.5916e-03, PNorm = 645.7908, GNorm = 0.0658, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.091164
Epoch 5951
Loss = 2.2631e-02, PNorm = 645.8238, GNorm = 0.0631, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.093963
Epoch 5952
Loss = 1.6541e-02, PNorm = 645.8559, GNorm = 0.3938, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.092326
Epoch 5953
Loss = 2.9946e-02, PNorm = 645.8803, GNorm = 0.2701, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.093932
Epoch 5954
Loss = 2.0656e-02, PNorm = 645.9062, GNorm = 0.1437, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.101379
Epoch 5955
Loss = 1.1215e-02, PNorm = 645.9374, GNorm = 0.8642, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.131380
Epoch 5956
Loss = 2.6584e-02, PNorm = 645.9632, GNorm = 1.2751, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.172049
Epoch 5957
Loss = 1.9238e-03, PNorm = 645.9930, GNorm = 0.3741, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.147519
Epoch 5958
Loss = 6.6141e-02, PNorm = 646.0119, GNorm = 0.1072, lr_0 = 9.9596e-04
Validation binary_cross_entropy = 0.119233
Epoch 5959
Loss = 1.6478e-02, PNorm = 646.0492, GNorm = 0.9441, lr_0 = 9.9596e-04
Loss = 1.1044e-02, PNorm = 646.1010, GNorm = 0.3696, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.086398
Epoch 5960
Loss = 4.2056e-02, PNorm = 646.1477, GNorm = 0.3016, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.085645
Epoch 5961
Loss = 2.9131e-02, PNorm = 646.1941, GNorm = 2.6635, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.092812
Epoch 5962
Loss = 4.0360e-02, PNorm = 646.2450, GNorm = 2.2134, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.101568
Epoch 5963
Loss = 7.5712e-03, PNorm = 646.2905, GNorm = 0.2098, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.104185
Epoch 5964
Loss = 5.5888e-02, PNorm = 646.3182, GNorm = 0.1425, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.102750
Epoch 5965
Loss = 4.1875e-02, PNorm = 646.3462, GNorm = 0.1119, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.099969
Epoch 5966
Loss = 2.7597e-02, PNorm = 646.3781, GNorm = 0.3436, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.099102
Epoch 5967
Loss = 2.6054e-02, PNorm = 646.4091, GNorm = 0.3442, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.108657
Epoch 5968
Loss = 1.2670e-02, PNorm = 646.4397, GNorm = 0.3128, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.095059
Epoch 5969
Loss = 6.6301e-03, PNorm = 646.4611, GNorm = 0.2176, lr_0 = 9.9595e-04
Loss = 5.0451e-02, PNorm = 646.4998, GNorm = 2.5391, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.087637
Epoch 5970
Loss = 3.7481e-02, PNorm = 646.5420, GNorm = 0.4996, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.093356
Epoch 5971
Loss = 5.2611e-02, PNorm = 646.5674, GNorm = 3.7954, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.083530
Epoch 5972
Loss = 1.9291e-02, PNorm = 646.6004, GNorm = 0.2832, lr_0 = 9.9595e-04
Validation binary_cross_entropy = 0.086528
Epoch 5973
Loss = 1.8013e-02, PNorm = 646.6279, GNorm = 0.0880, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.086508
Epoch 5974
Loss = 2.5738e-02, PNorm = 646.6481, GNorm = 4.3119, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.087546
Epoch 5975
Loss = 7.0878e-03, PNorm = 646.6776, GNorm = 0.0911, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.091165
Epoch 5976
Loss = 2.3685e-02, PNorm = 646.7028, GNorm = 1.9011, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.095730
Epoch 5977
Loss = 4.8660e-02, PNorm = 646.7435, GNorm = 2.9144, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.088601
Epoch 5978
Loss = 3.0021e-03, PNorm = 646.7852, GNorm = 0.3170, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.105016
Epoch 5979
Loss = 1.0142e-02, PNorm = 646.8289, GNorm = 0.4820, lr_0 = 9.9594e-04
Loss = 6.6682e-03, PNorm = 646.8530, GNorm = 0.2077, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.115282
Epoch 5980
Loss = 1.6754e-02, PNorm = 646.8690, GNorm = 0.0534, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.095652
Epoch 5981
Loss = 2.7240e-02, PNorm = 646.8989, GNorm = 0.1460, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.099178
Epoch 5982
Loss = 2.3563e-02, PNorm = 646.9286, GNorm = 1.3124, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.122690
Epoch 5983
Loss = 3.7511e-02, PNorm = 646.9555, GNorm = 0.0126, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.106933
Epoch 5984
Loss = 3.1405e-02, PNorm = 646.9794, GNorm = 0.6367, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.106545
Epoch 5985
Loss = 2.7061e-02, PNorm = 647.0058, GNorm = 1.7614, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.102254
Epoch 5986
Loss = 5.9253e-02, PNorm = 647.0370, GNorm = 1.4849, lr_0 = 9.9594e-04
Validation binary_cross_entropy = 0.091725
Epoch 5987
Loss = 3.9569e-03, PNorm = 647.0754, GNorm = 0.1118, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.090640
Epoch 5988
Loss = 2.6302e-01, PNorm = 647.0992, GNorm = 0.0421, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.064665
Epoch 5989
Loss = 3.8961e-02, PNorm = 647.1366, GNorm = 1.0361, lr_0 = 9.9593e-04
Loss = 1.7655e-02, PNorm = 647.2191, GNorm = 0.2161, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.068294
Epoch 5990
Loss = 1.9263e-02, PNorm = 647.2863, GNorm = 1.6448, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.081567
Epoch 5991
Loss = 1.7608e-02, PNorm = 647.3156, GNorm = 0.4809, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.078171
Epoch 5992
Loss = 2.4839e-02, PNorm = 647.3332, GNorm = 1.6067, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.077084
Epoch 5993
Loss = 2.7437e-02, PNorm = 647.3715, GNorm = 0.2016, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.083896
Epoch 5994
Loss = 6.9250e-02, PNorm = 647.4091, GNorm = 8.6486, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.081892
Epoch 5995
Loss = 3.5804e-03, PNorm = 647.4431, GNorm = 0.2498, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.105431
Epoch 5996
Loss = 1.0452e-01, PNorm = 647.4755, GNorm = 2.7652, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.066057
Epoch 5997
Loss = 7.3437e-02, PNorm = 647.6034, GNorm = 1.0160, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.074645
Epoch 5998
Loss = 7.8059e-02, PNorm = 647.7732, GNorm = 1.0903, lr_0 = 9.9593e-04
Validation binary_cross_entropy = 0.073671
Epoch 5999
Loss = 6.0616e-02, PNorm = 647.8909, GNorm = 1.5309, lr_0 = 9.9593e-04
Loss = 8.7776e-02, PNorm = 647.9717, GNorm = 2.4808, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.100706
Epoch 6000
Loss = 6.5455e-02, PNorm = 648.0302, GNorm = 2.2297, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.077177
Epoch 6001
Loss = 5.1989e-02, PNorm = 648.0819, GNorm = 0.0584, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.093671
Epoch 6002
Loss = 6.1012e-02, PNorm = 648.1223, GNorm = 4.1540, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.074582
Epoch 6003
Loss = 9.7303e-02, PNorm = 648.1686, GNorm = 2.9312, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.099765
Epoch 6004
Loss = 5.6028e-02, PNorm = 648.2154, GNorm = 2.4867, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.059651
Epoch 6005
Loss = 4.0646e-02, PNorm = 648.2604, GNorm = 0.5634, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.062229
Epoch 6006
Loss = 5.8774e-02, PNorm = 648.3191, GNorm = 0.9898, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.072733
Epoch 6007
Loss = 6.6301e-02, PNorm = 648.3637, GNorm = 1.3118, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.066834
Epoch 6008
Loss = 8.7335e-02, PNorm = 648.3999, GNorm = 4.2765, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.070829
Epoch 6009
Loss = 6.7929e-03, PNorm = 648.4387, GNorm = 0.4243, lr_0 = 9.9592e-04
Loss = 6.2931e-02, PNorm = 648.4820, GNorm = 3.2128, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.069807
Epoch 6010
Loss = 4.6895e-02, PNorm = 648.5240, GNorm = 1.7673, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.061013
Epoch 6011
Loss = 6.8547e-02, PNorm = 648.5786, GNorm = 1.4335, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.074166
Epoch 6012
Loss = 3.6775e-02, PNorm = 648.6281, GNorm = 0.4654, lr_0 = 9.9592e-04
Validation binary_cross_entropy = 0.074331
Epoch 6013
Loss = 2.6842e-02, PNorm = 648.6650, GNorm = 2.2753, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.071435
Epoch 6014
Loss = 5.3086e-02, PNorm = 648.7020, GNorm = 0.6068, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.091155
Epoch 6015
Loss = 3.8238e-02, PNorm = 648.7336, GNorm = 0.8479, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.069644
Epoch 6016
Loss = 3.6460e-02, PNorm = 648.7598, GNorm = 5.1354, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.061255
Epoch 6017
Loss = 3.8740e-02, PNorm = 648.8007, GNorm = 3.6901, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.087471
Epoch 6018
Loss = 5.6426e-02, PNorm = 648.8628, GNorm = 1.8277, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.070837
Epoch 6019
Loss = 5.7888e-02, PNorm = 648.9201, GNorm = 1.9334, lr_0 = 9.9591e-04
Loss = 1.2411e-01, PNorm = 648.9793, GNorm = 7.2697, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.068507
Epoch 6020
Loss = 3.8502e-02, PNorm = 649.0516, GNorm = 0.7322, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.078849
Epoch 6021
Loss = 2.6453e-02, PNorm = 649.1143, GNorm = 1.1803, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.086451
Epoch 6022
Loss = 4.0104e-02, PNorm = 649.1710, GNorm = 1.6459, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.104199
Epoch 6023
Loss = 7.0907e-02, PNorm = 649.2061, GNorm = 20.6400, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.071888
Epoch 6024
Loss = 9.7771e-02, PNorm = 649.2730, GNorm = 1.2391, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.086749
Epoch 6025
Loss = 3.2382e-02, PNorm = 649.3733, GNorm = 1.0907, lr_0 = 9.9591e-04
Validation binary_cross_entropy = 0.095379
Epoch 6026
Loss = 5.9638e-02, PNorm = 649.4241, GNorm = 0.8545, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.070877
Epoch 6027
Loss = 1.7770e-02, PNorm = 649.4655, GNorm = 0.4489, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.087748
Epoch 6028
Loss = 4.9161e-02, PNorm = 649.5193, GNorm = 0.5288, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.091377
Epoch 6029
Loss = 6.2748e-03, PNorm = 649.5568, GNorm = 0.3992, lr_0 = 9.9590e-04
Loss = 3.2582e-02, PNorm = 649.5840, GNorm = 5.7988, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.084503
Epoch 6030
Loss = 2.0152e-02, PNorm = 649.6207, GNorm = 0.4492, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.093479
Epoch 6031
Loss = 2.6119e-02, PNorm = 649.6551, GNorm = 0.1458, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.096075
Epoch 6032
Loss = 1.8850e-02, PNorm = 649.6871, GNorm = 1.5047, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.107439
Epoch 6033
Loss = 1.4384e-02, PNorm = 649.7209, GNorm = 0.0173, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.131826
Epoch 6034
Loss = 4.5466e-02, PNorm = 649.7528, GNorm = 0.9139, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.086724
Epoch 6035
Loss = 3.3597e-02, PNorm = 649.7949, GNorm = 0.0641, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.080229
Epoch 6036
Loss = 1.8450e-02, PNorm = 649.8343, GNorm = 0.1547, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.103142
Epoch 6037
Loss = 5.6613e-02, PNorm = 649.8639, GNorm = 1.0998, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.089679
Epoch 6038
Loss = 6.2742e-02, PNorm = 649.8844, GNorm = 4.8578, lr_0 = 9.9590e-04
Validation binary_cross_entropy = 0.104366
Epoch 6039
Loss = 6.7269e-03, PNorm = 649.9137, GNorm = 0.4081, lr_0 = 9.9590e-04
Loss = 1.7254e-02, PNorm = 649.9364, GNorm = 3.1959, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.112474
Epoch 6040
Loss = 3.1881e-02, PNorm = 649.9531, GNorm = 4.8496, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.105802
Epoch 6041
Loss = 1.2510e-02, PNorm = 649.9882, GNorm = 0.9402, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.130612
Epoch 6042
Loss = 1.6644e-02, PNorm = 650.0160, GNorm = 1.6800, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.116602
Epoch 6043
Loss = 6.2744e-03, PNorm = 650.0353, GNorm = 0.1786, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.090252
Epoch 6044
Loss = 4.8504e-02, PNorm = 650.0646, GNorm = 1.0145, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.087181
Epoch 6045
Loss = 3.2269e-02, PNorm = 650.0972, GNorm = 1.1702, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.081913
Epoch 6046
Loss = 9.7059e-03, PNorm = 650.1260, GNorm = 0.1473, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.089589
Epoch 6047
Loss = 1.8918e-02, PNorm = 650.1464, GNorm = 0.0377, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.079825
Epoch 6048
Loss = 4.3436e-02, PNorm = 650.1611, GNorm = 2.3332, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.089545
Epoch 6049
Loss = 8.8961e-02, PNorm = 650.1845, GNorm = 2.5055, lr_0 = 9.9589e-04
Loss = 2.6482e-02, PNorm = 650.2061, GNorm = 5.4138, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.083169
Epoch 6050
Loss = 6.7995e-03, PNorm = 650.2372, GNorm = 0.0953, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.090761
Epoch 6051
Loss = 8.8293e-02, PNorm = 650.2748, GNorm = 3.4804, lr_0 = 9.9589e-04
Validation binary_cross_entropy = 0.075921
Epoch 6052
Loss = 1.5738e-02, PNorm = 650.3311, GNorm = 0.3317, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.075274
Epoch 6053
Loss = 4.7547e-02, PNorm = 650.3920, GNorm = 2.1268, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.083206
Epoch 6054
Loss = 5.6060e-02, PNorm = 650.4536, GNorm = 1.4388, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.087610
Epoch 6055
Loss = 3.5748e-02, PNorm = 650.5104, GNorm = 2.0583, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.108419
Epoch 6056
Loss = 5.8516e-03, PNorm = 650.5599, GNorm = 0.4105, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.122741
Epoch 6057
Loss = 3.0642e-02, PNorm = 650.5856, GNorm = 1.3302, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.121408
Epoch 6058
Loss = 7.5938e-02, PNorm = 650.6115, GNorm = 0.0524, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.125218
Epoch 6059
Loss = 5.0814e-02, PNorm = 650.6391, GNorm = 0.8883, lr_0 = 9.9588e-04
Loss = 1.8696e-02, PNorm = 650.6527, GNorm = 0.0520, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.102481
Epoch 6060
Loss = 3.3168e-02, PNorm = 650.6784, GNorm = 3.1103, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.092117
Epoch 6061
Loss = 2.7542e-02, PNorm = 650.7137, GNorm = 0.2844, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.088417
Epoch 6062
Loss = 2.6622e-02, PNorm = 650.7485, GNorm = 0.8425, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.074886
Epoch 6063
Loss = 2.5262e-02, PNorm = 650.8107, GNorm = 0.4355, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.104761
Epoch 6064
Loss = 6.2537e-02, PNorm = 650.8637, GNorm = 0.9147, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.108430
Epoch 6065
Loss = 1.6674e-02, PNorm = 650.9040, GNorm = 0.0404, lr_0 = 9.9588e-04
Validation binary_cross_entropy = 0.150729
Epoch 6066
Loss = 1.2819e-01, PNorm = 650.9323, GNorm = 5.3792, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.087924
Epoch 6067
Loss = 1.7214e-02, PNorm = 650.9913, GNorm = 0.0939, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.081879
Epoch 6068
Loss = 3.8750e-03, PNorm = 651.0681, GNorm = 0.2783, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.081243
Epoch 6069
Loss = 5.4112e-04, PNorm = 651.1171, GNorm = 0.0224, lr_0 = 9.9587e-04
Loss = 1.7735e-02, PNorm = 651.1656, GNorm = 0.7030, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.082575
Epoch 6070
Loss = 2.4994e-02, PNorm = 651.2089, GNorm = 0.1348, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.082867
Epoch 6071
Loss = 3.2204e-02, PNorm = 651.2460, GNorm = 0.0348, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.095018
Epoch 6072
Loss = 2.5629e-02, PNorm = 651.2674, GNorm = 0.0308, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.076143
Epoch 6073
Loss = 6.6506e-03, PNorm = 651.2977, GNorm = 0.2199, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.075881
Epoch 6074
Loss = 5.0717e-03, PNorm = 651.3367, GNorm = 0.5100, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.083136
Epoch 6075
Loss = 1.5573e-02, PNorm = 651.3659, GNorm = 1.2667, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.085994
Epoch 6076
Loss = 5.1780e-02, PNorm = 651.3971, GNorm = 0.0463, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.084751
Epoch 6077
Loss = 2.0390e-03, PNorm = 651.4316, GNorm = 0.0453, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.105862
Epoch 6078
Loss = 2.9775e-02, PNorm = 651.4715, GNorm = 0.6826, lr_0 = 9.9587e-04
Validation binary_cross_entropy = 0.103771
Epoch 6079
Loss = 3.1542e-04, PNorm = 651.5001, GNorm = 0.0185, lr_0 = 9.9587e-04
Loss = 1.5044e-02, PNorm = 651.5243, GNorm = 0.1005, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.083891
Epoch 6080
Loss = 3.2051e-02, PNorm = 651.5594, GNorm = 0.8000, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.085675
Epoch 6081
Loss = 2.3448e-02, PNorm = 651.6026, GNorm = 1.2309, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.098293
Epoch 6082
Loss = 2.0430e-02, PNorm = 651.6451, GNorm = 0.8369, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.182700
Epoch 6083
Loss = 3.0797e-02, PNorm = 651.6968, GNorm = 0.0635, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.224434
Epoch 6084
Loss = 7.1338e-02, PNorm = 651.7434, GNorm = 0.8434, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.071876
Epoch 6085
Loss = 2.2804e-02, PNorm = 651.7930, GNorm = 0.2175, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.070446
Epoch 6086
Loss = 6.1803e-02, PNorm = 651.8551, GNorm = 2.0474, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.077187
Epoch 6087
Loss = 6.1263e-03, PNorm = 651.9195, GNorm = 0.0578, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.082675
Epoch 6088
Loss = 4.4260e-02, PNorm = 651.9623, GNorm = 0.0984, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.071053
Epoch 6089
Loss = 1.1875e-02, PNorm = 652.0021, GNorm = 0.7963, lr_0 = 9.9586e-04
Loss = 2.7866e-02, PNorm = 652.0415, GNorm = 0.7954, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.063706
Epoch 6090
Loss = 2.8991e-02, PNorm = 652.0907, GNorm = 0.1901, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.067616
Epoch 6091
Loss = 9.5488e-03, PNorm = 652.1335, GNorm = 0.1149, lr_0 = 9.9586e-04
Validation binary_cross_entropy = 0.073456
Epoch 6092
Loss = 2.3058e-01, PNorm = 652.1890, GNorm = 1.5064, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.061534
Epoch 6093
Loss = 2.0485e-01, PNorm = 652.2980, GNorm = 2.3021, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.105770
Epoch 6094
Loss = 2.5878e-02, PNorm = 652.3896, GNorm = 1.5646, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.081579
Epoch 6095
Loss = 5.1056e-02, PNorm = 652.4732, GNorm = 4.9592, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.070204
Epoch 6096
Loss = 9.8633e-02, PNorm = 652.6387, GNorm = 3.2012, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.161811
Epoch 6097
Loss = 1.3776e-01, PNorm = 652.7936, GNorm = 4.0615, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.085524
Epoch 6098
Loss = 6.0968e-02, PNorm = 652.9142, GNorm = 3.5009, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.134673
Epoch 6099
Loss = 1.0172e-01, PNorm = 653.0130, GNorm = 2.0663, lr_0 = 9.9585e-04
Loss = 9.4519e-02, PNorm = 653.0924, GNorm = 2.7113, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.084128
Epoch 6100
Loss = 4.3226e-02, PNorm = 653.1771, GNorm = 1.1656, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.102727
Epoch 6101
Loss = 4.9422e-02, PNorm = 653.2368, GNorm = 1.1424, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.107479
Epoch 6102
Loss = 5.3888e-02, PNorm = 653.2778, GNorm = 1.9930, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.082713
Epoch 6103
Loss = 4.8113e-02, PNorm = 653.3269, GNorm = 1.3739, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.105713
Epoch 6104
Loss = 5.3250e-02, PNorm = 653.3768, GNorm = 1.4071, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.080890
Epoch 6105
Loss = 4.9054e-02, PNorm = 653.4087, GNorm = 0.3500, lr_0 = 9.9585e-04
Validation binary_cross_entropy = 0.087002
Epoch 6106
Loss = 2.2319e-02, PNorm = 653.4602, GNorm = 1.2966, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.115331
Epoch 6107
Loss = 2.6324e-02, PNorm = 653.5027, GNorm = 1.9562, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.103934
Epoch 6108
Loss = 8.7758e-03, PNorm = 653.5415, GNorm = 0.5758, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.147428
Epoch 6109
Loss = 1.7279e-02, PNorm = 653.5779, GNorm = 0.9767, lr_0 = 9.9584e-04
Loss = 9.4413e-02, PNorm = 653.6012, GNorm = 1.8518, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.123115
Epoch 6110
Loss = 9.3518e-02, PNorm = 653.6691, GNorm = 0.8552, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.102866
Epoch 6111
Loss = 9.0318e-02, PNorm = 653.7585, GNorm = 2.7110, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.170795
Epoch 6112
Loss = 1.3570e-01, PNorm = 653.8188, GNorm = 4.5581, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.097886
Epoch 6113
Loss = 4.3579e-02, PNorm = 653.9153, GNorm = 1.9845, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.102402
Epoch 6114
Loss = 7.5953e-02, PNorm = 653.9864, GNorm = 4.2634, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.127861
Epoch 6115
Loss = 5.8598e-02, PNorm = 654.0654, GNorm = 1.3511, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.100078
Epoch 6116
Loss = 2.4764e-02, PNorm = 654.1289, GNorm = 3.1003, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.122907
Epoch 6117
Loss = 3.0081e-02, PNorm = 654.1913, GNorm = 2.8993, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.122020
Epoch 6118
Loss = 5.8187e-02, PNorm = 654.2418, GNorm = 2.7625, lr_0 = 9.9584e-04
Validation binary_cross_entropy = 0.128600
Epoch 6119
Loss = 1.1682e-01, PNorm = 654.2910, GNorm = 8.0399, lr_0 = 9.9583e-04
Loss = 3.7693e-02, PNorm = 654.3273, GNorm = 0.2758, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.132779
Epoch 6120
Loss = 1.7267e-02, PNorm = 654.3544, GNorm = 0.5280, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.123003
Epoch 6121
Loss = 3.6473e-02, PNorm = 654.3798, GNorm = 0.8227, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.116484
Epoch 6122
Loss = 3.0173e-02, PNorm = 654.4084, GNorm = 4.2155, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.161390
Epoch 6123
Loss = 8.9158e-02, PNorm = 654.4528, GNorm = 0.0673, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.096211
Epoch 6124
Loss = 5.6327e-02, PNorm = 654.5137, GNorm = 2.7685, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.072260
Epoch 6125
Loss = 1.0099e-01, PNorm = 654.6243, GNorm = 2.5349, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.104010
Epoch 6126
Loss = 3.7100e-02, PNorm = 654.7016, GNorm = 0.9787, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.092756
Epoch 6127
Loss = 6.2230e-02, PNorm = 654.7532, GNorm = 1.0842, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.107143
Epoch 6128
Loss = 6.6129e-02, PNorm = 654.8043, GNorm = 1.1435, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.080154
Epoch 6129
Loss = 9.9591e-03, PNorm = 654.8572, GNorm = 0.2231, lr_0 = 9.9583e-04
Loss = 3.2668e-02, PNorm = 654.8994, GNorm = 1.3044, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.075159
Epoch 6130
Loss = 2.0065e-02, PNorm = 654.9399, GNorm = 0.3693, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.081813
Epoch 6131
Loss = 1.3124e-02, PNorm = 654.9714, GNorm = 1.7572, lr_0 = 9.9583e-04
Validation binary_cross_entropy = 0.086946
Epoch 6132
Loss = 1.6777e-02, PNorm = 654.9999, GNorm = 0.0593, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.091848
Epoch 6133
Loss = 3.5780e-02, PNorm = 655.0228, GNorm = 3.6366, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.094110
Epoch 6134
Loss = 2.3626e-02, PNorm = 655.0477, GNorm = 0.0585, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.093911
Epoch 6135
Loss = 1.5509e-03, PNorm = 655.0719, GNorm = 0.1104, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.087784
Epoch 6136
Loss = 2.9402e-02, PNorm = 655.0923, GNorm = 0.4136, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.085320
Epoch 6137
Loss = 7.4548e-03, PNorm = 655.1289, GNorm = 0.8617, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.101940
Epoch 6138
Loss = 3.9211e-04, PNorm = 655.1607, GNorm = 0.0230, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.102184
Epoch 6139
Loss = 2.8627e-03, PNorm = 655.1745, GNorm = 0.0998, lr_0 = 9.9582e-04
Loss = 3.6815e-02, PNorm = 655.1877, GNorm = 1.8191, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.081628
Epoch 6140
Loss = 1.6078e-02, PNorm = 655.2217, GNorm = 3.4159, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.081286
Epoch 6141
Loss = 2.3666e-02, PNorm = 655.2680, GNorm = 0.0345, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.086565
Epoch 6142
Loss = 1.0851e-02, PNorm = 655.2934, GNorm = 1.8813, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.083359
Epoch 6143
Loss = 3.1159e-02, PNorm = 655.3300, GNorm = 1.3745, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.094296
Epoch 6144
Loss = 1.4773e-02, PNorm = 655.3629, GNorm = 0.0530, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.096149
Epoch 6145
Loss = 4.1661e-02, PNorm = 655.3870, GNorm = 0.1049, lr_0 = 9.9582e-04
Validation binary_cross_entropy = 0.087209
Epoch 6146
Loss = 1.3313e-02, PNorm = 655.4188, GNorm = 0.1250, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.095991
Epoch 6147
Loss = 3.0581e-02, PNorm = 655.4526, GNorm = 0.5475, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.101041
Epoch 6148
Loss = 7.2429e-03, PNorm = 655.4728, GNorm = 0.1013, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.100862
Epoch 6149
Loss = 5.3319e-03, PNorm = 655.4935, GNorm = 0.4548, lr_0 = 9.9581e-04
Loss = 2.7233e-02, PNorm = 655.5101, GNorm = 0.6598, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.091446
Epoch 6150
Loss = 1.1932e-02, PNorm = 655.5483, GNorm = 0.1170, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.110912
Epoch 6151
Loss = 1.9075e-02, PNorm = 655.5822, GNorm = 0.1267, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.137180
Epoch 6152
Loss = 1.2665e-03, PNorm = 655.6071, GNorm = 0.0423, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.195697
Epoch 6153
Loss = 4.9706e-03, PNorm = 655.6257, GNorm = 0.0216, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.200843
Epoch 6154
Loss = 8.1315e-02, PNorm = 655.6399, GNorm = 0.1359, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.130257
Epoch 6155
Loss = 7.9061e-02, PNorm = 655.6595, GNorm = 0.7435, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.069972
Epoch 6156
Loss = 5.3701e-02, PNorm = 655.7141, GNorm = 0.8894, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.078614
Epoch 6157
Loss = 1.6480e-02, PNorm = 655.7936, GNorm = 0.3013, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.078317
Epoch 6158
Loss = 2.6161e-02, PNorm = 655.8540, GNorm = 2.0734, lr_0 = 9.9581e-04
Validation binary_cross_entropy = 0.079763
Epoch 6159
Loss = 2.0258e-03, PNorm = 655.9034, GNorm = 0.0938, lr_0 = 9.9580e-04
Loss = 2.1034e-02, PNorm = 655.9568, GNorm = 0.1259, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.091617
Epoch 6160
Loss = 6.7798e-02, PNorm = 655.9991, GNorm = 12.6820, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.088995
Epoch 6161
Loss = 2.5160e-02, PNorm = 656.0467, GNorm = 0.1602, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.093354
Epoch 6162
Loss = 3.7453e-02, PNorm = 656.1030, GNorm = 1.3174, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.086140
Epoch 6163
Loss = 2.0794e-02, PNorm = 656.1460, GNorm = 1.3558, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.087629
Epoch 6164
Loss = 1.9530e-02, PNorm = 656.1901, GNorm = 1.3938, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.106989
Epoch 6165
Loss = 3.6661e-02, PNorm = 656.2329, GNorm = 0.0714, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.106746
Epoch 6166
Loss = 8.0825e-03, PNorm = 656.2842, GNorm = 0.0652, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.081279
Epoch 6167
Loss = 1.0184e-01, PNorm = 656.3246, GNorm = 2.2723, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.070111
Epoch 6168
Loss = 1.1614e-02, PNorm = 656.3971, GNorm = 0.5367, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.071246
Epoch 6169
Loss = 6.4281e-03, PNorm = 656.4499, GNorm = 0.2276, lr_0 = 9.9580e-04
Loss = 8.7803e-03, PNorm = 656.5024, GNorm = 0.1802, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.082944
Epoch 6170
Loss = 4.6459e-02, PNorm = 656.5403, GNorm = 3.3440, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.086936
Epoch 6171
Loss = 3.1258e-02, PNorm = 656.5900, GNorm = 1.4650, lr_0 = 9.9580e-04
Validation binary_cross_entropy = 0.095366
Epoch 6172
Loss = 6.9073e-02, PNorm = 656.6535, GNorm = 0.1035, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.099770
Epoch 6173
Loss = 1.8205e-02, PNorm = 656.6904, GNorm = 1.7407, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.088921
Epoch 6174
Loss = 2.1405e-02, PNorm = 656.7159, GNorm = 2.9731, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.090905
Epoch 6175
Loss = 1.4671e-02, PNorm = 656.7626, GNorm = 2.0337, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.113586
Epoch 6176
Loss = 4.8330e-02, PNorm = 656.7982, GNorm = 1.6908, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.088740
Epoch 6177
Loss = 2.2069e-02, PNorm = 656.8249, GNorm = 1.1719, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.090394
Epoch 6178
Loss = 1.8908e-02, PNorm = 656.8625, GNorm = 0.0681, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.105922
Epoch 6179
Loss = 2.0517e-03, PNorm = 656.8977, GNorm = 0.0699, lr_0 = 9.9579e-04
Loss = 1.4215e-02, PNorm = 656.9286, GNorm = 0.7279, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.122110
Epoch 6180
Loss = 4.1820e-02, PNorm = 656.9535, GNorm = 0.9090, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.108639
Epoch 6181
Loss = 9.2553e-03, PNorm = 656.9962, GNorm = 0.0194, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.114894
Epoch 6182
Loss = 1.8889e-01, PNorm = 657.0438, GNorm = 3.7967, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.070334
Epoch 6183
Loss = 3.2024e-02, PNorm = 657.1291, GNorm = 2.5774, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.111873
Epoch 6184
Loss = 3.2774e-02, PNorm = 657.2012, GNorm = 2.2920, lr_0 = 9.9579e-04
Validation binary_cross_entropy = 0.095173
Epoch 6185
Loss = 3.0190e-02, PNorm = 657.2523, GNorm = 0.5952, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.098988
Epoch 6186
Loss = 5.1192e-02, PNorm = 657.3011, GNorm = 2.5602, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.082337
Epoch 6187
Loss = 2.8047e-02, PNorm = 657.3520, GNorm = 0.2457, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.079878
Epoch 6188
Loss = 2.5287e-02, PNorm = 657.4040, GNorm = 1.4993, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.083144
Epoch 6189
Loss = 1.0426e-02, PNorm = 657.4501, GNorm = 0.5532, lr_0 = 9.9578e-04
Loss = 2.3511e-02, PNorm = 657.4989, GNorm = 0.8506, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.101998
Epoch 6190
Loss = 1.2748e-02, PNorm = 657.5395, GNorm = 0.0687, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.098872
Epoch 6191
Loss = 4.3850e-02, PNorm = 657.5668, GNorm = 4.5969, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.090883
Epoch 6192
Loss = 7.8454e-02, PNorm = 657.6072, GNorm = 3.7396, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.094074
Epoch 6193
Loss = 1.8830e-02, PNorm = 657.6604, GNorm = 0.3754, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.121087
Epoch 6194
Loss = 5.2678e-02, PNorm = 657.6957, GNorm = 1.1183, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.079343
Epoch 6195
Loss = 1.3343e-02, PNorm = 657.7227, GNorm = 0.0245, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.075581
Epoch 6196
Loss = 2.6717e-02, PNorm = 657.7705, GNorm = 0.7798, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.107132
Epoch 6197
Loss = 7.5754e-03, PNorm = 657.8115, GNorm = 0.4421, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.117157
Epoch 6198
Loss = 6.1282e-02, PNorm = 657.8407, GNorm = 0.9290, lr_0 = 9.9578e-04
Validation binary_cross_entropy = 0.081515
Epoch 6199
Loss = 1.3098e-01, PNorm = 657.8690, GNorm = 3.3392, lr_0 = 9.9577e-04
Loss = 6.0154e-02, PNorm = 657.9213, GNorm = 4.0370, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.074739
Epoch 6200
Loss = 3.1746e-02, PNorm = 657.9855, GNorm = 0.2455, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.107830
Epoch 6201
Loss = 5.7610e-03, PNorm = 658.0223, GNorm = 0.8191, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.108095
Epoch 6202
Loss = 7.3007e-01, PNorm = 658.0918, GNorm = 10.9680, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.100782
Epoch 6203
Loss = 3.8888e-01, PNorm = 658.4599, GNorm = 5.5806, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.133787
Epoch 6204
Loss = 3.5432e-01, PNorm = 658.7434, GNorm = 4.3288, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.267627
Epoch 6205
Loss = 2.9776e-01, PNorm = 658.9294, GNorm = 6.9259, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.213492
Epoch 6206
Loss = 1.3659e-01, PNorm = 659.0680, GNorm = 1.9190, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.203121
Epoch 6207
Loss = 1.2430e-01, PNorm = 659.2016, GNorm = 2.2199, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.150151
Epoch 6208
Loss = 1.9499e-01, PNorm = 659.3312, GNorm = 3.4200, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.210961
Epoch 6209
Loss = 1.6796e-01, PNorm = 659.4399, GNorm = 4.8092, lr_0 = 9.9577e-04
Loss = 1.1240e-01, PNorm = 659.5249, GNorm = 1.2873, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.216930
Epoch 6210
Loss = 1.2193e-01, PNorm = 659.5933, GNorm = 1.1330, lr_0 = 9.9577e-04
Validation binary_cross_entropy = 0.174058
Epoch 6211
Loss = 8.9260e-02, PNorm = 659.6577, GNorm = 2.0071, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.181315
Epoch 6212
Loss = 1.0728e-01, PNorm = 659.7235, GNorm = 4.9588, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.141151
Epoch 6213
Loss = 1.0815e-01, PNorm = 659.7970, GNorm = 2.9323, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.154541
Epoch 6214
Loss = 1.3271e-01, PNorm = 659.8666, GNorm = 0.1712, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.127360
Epoch 6215
Loss = 1.1041e-01, PNorm = 659.9359, GNorm = 5.3619, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.137093
Epoch 6216
Loss = 3.6456e-02, PNorm = 660.0064, GNorm = 2.4803, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.313324
Epoch 6217
Loss = 2.8073e-01, PNorm = 660.0643, GNorm = 20.5266, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.490723
Epoch 6218
Loss = 4.9488e-01, PNorm = 660.1974, GNorm = 11.6435, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.543746
Epoch 6219
Loss = 4.9737e-01, PNorm = 660.3014, GNorm = 3.1915, lr_0 = 9.9576e-04
Loss = 4.7806e-01, PNorm = 660.3801, GNorm = 3.3580, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.328052
Epoch 6220
Loss = 3.8819e-01, PNorm = 660.4540, GNorm = 6.9068, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.303065
Epoch 6221
Loss = 3.0011e-01, PNorm = 660.5254, GNorm = 3.7011, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.392297
Epoch 6222
Loss = 2.4561e-01, PNorm = 660.5911, GNorm = 4.4435, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.178397
Epoch 6223
Loss = 1.7725e-01, PNorm = 660.6574, GNorm = 2.8863, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.175809
Epoch 6224
Loss = 1.0202e-01, PNorm = 660.7177, GNorm = 2.1038, lr_0 = 9.9576e-04
Validation binary_cross_entropy = 0.215183
Epoch 6225
Loss = 1.4072e-01, PNorm = 660.7807, GNorm = 1.8680, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.157564
Epoch 6226
Loss = 1.6440e-01, PNorm = 660.8452, GNorm = 2.7966, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.151231
Epoch 6227
Loss = 8.7186e-02, PNorm = 660.9203, GNorm = 1.9263, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.106958
Epoch 6228
Loss = 9.4826e-02, PNorm = 660.9842, GNorm = 1.7215, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.204577
Epoch 6229
Loss = 4.6969e-02, PNorm = 661.0586, GNorm = 3.3037, lr_0 = 9.9575e-04
Loss = 1.1619e-01, PNorm = 661.1152, GNorm = 2.5575, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.113204
Epoch 6230
Loss = 6.9110e-02, PNorm = 661.1811, GNorm = 1.0378, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.165395
Epoch 6231
Loss = 1.0516e-01, PNorm = 661.2373, GNorm = 2.3766, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.113820
Epoch 6232
Loss = 1.2544e-01, PNorm = 661.3011, GNorm = 1.4471, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.133929
Epoch 6233
Loss = 8.3241e-02, PNorm = 661.3553, GNorm = 3.4699, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.112964
Epoch 6234
Loss = 1.1681e-01, PNorm = 661.4098, GNorm = 1.3361, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.124432
Epoch 6235
Loss = 4.6527e-02, PNorm = 661.4529, GNorm = 0.7729, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.091948
Epoch 6236
Loss = 1.1864e-01, PNorm = 661.4983, GNorm = 0.4885, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.101825
Epoch 6237
Loss = 9.5372e-02, PNorm = 661.5558, GNorm = 1.6917, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.105647
Epoch 6238
Loss = 4.9537e-02, PNorm = 661.6050, GNorm = 1.1780, lr_0 = 9.9575e-04
Validation binary_cross_entropy = 0.117259
Epoch 6239
Loss = 5.4214e-02, PNorm = 661.6499, GNorm = 2.1354, lr_0 = 9.9574e-04
Loss = 7.2282e-02, PNorm = 661.6969, GNorm = 2.4360, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.122656
Epoch 6240
Loss = 6.3909e-02, PNorm = 661.7474, GNorm = 1.2732, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.117234
Epoch 6241
Loss = 3.4088e-02, PNorm = 661.7927, GNorm = 1.8970, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.114203
Epoch 6242
Loss = 4.9348e-02, PNorm = 661.8256, GNorm = 6.2385, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.112656
Epoch 6243
Loss = 7.0022e-02, PNorm = 661.8680, GNorm = 3.4922, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.115424
Epoch 6244
Loss = 7.1212e-02, PNorm = 661.9101, GNorm = 2.8161, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.108312
Epoch 6245
Loss = 7.8082e-02, PNorm = 661.9551, GNorm = 0.5903, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.088719
Epoch 6246
Loss = 3.5804e-02, PNorm = 661.9961, GNorm = 1.9734, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.090190
Epoch 6247
Loss = 8.1258e-02, PNorm = 662.0493, GNorm = 0.9173, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.098654
Epoch 6248
Loss = 5.4577e-02, PNorm = 662.1081, GNorm = 2.1326, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.092484
Epoch 6249
Loss = 3.3641e-02, PNorm = 662.1606, GNorm = 1.5115, lr_0 = 9.9574e-04
Loss = 5.9635e-02, PNorm = 662.2201, GNorm = 5.7944, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.105303
Epoch 6250
Loss = 5.6913e-02, PNorm = 662.2879, GNorm = 2.0858, lr_0 = 9.9574e-04
Validation binary_cross_entropy = 0.133409
Epoch 6251
Loss = 5.7195e-02, PNorm = 662.3381, GNorm = 0.2181, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.115817
Epoch 6252
Loss = 4.8524e-02, PNorm = 662.3880, GNorm = 0.5387, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.126034
Epoch 6253
Loss = 4.0914e-02, PNorm = 662.4291, GNorm = 1.7033, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.131329
Epoch 6254
Loss = 5.9791e-02, PNorm = 662.4791, GNorm = 2.6085, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.140150
Epoch 6255
Loss = 5.1469e-02, PNorm = 662.5214, GNorm = 0.9263, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.148341
Epoch 6256
Loss = 7.4749e-02, PNorm = 662.5548, GNorm = 1.9201, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.120336
Epoch 6257
Loss = 7.1408e-02, PNorm = 662.5879, GNorm = 1.4331, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.117027
Epoch 6258
Loss = 1.1633e-01, PNorm = 662.6272, GNorm = 0.3108, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.160329
Epoch 6259
Loss = 3.1307e-02, PNorm = 662.6644, GNorm = 1.3924, lr_0 = 9.9573e-04
Loss = 4.4616e-02, PNorm = 662.6901, GNorm = 0.8383, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.148723
Epoch 6260
Loss = 4.8359e-02, PNorm = 662.7143, GNorm = 1.7225, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.111531
Epoch 6261
Loss = 5.4443e-02, PNorm = 662.7461, GNorm = 1.3218, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.095849
Epoch 6262
Loss = 2.9383e-02, PNorm = 662.7951, GNorm = 0.5839, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.120796
Epoch 6263
Loss = 2.9623e-02, PNorm = 662.8416, GNorm = 1.4412, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.134602
Epoch 6264
Loss = 3.3895e-02, PNorm = 662.8706, GNorm = 1.2369, lr_0 = 9.9573e-04
Validation binary_cross_entropy = 0.124203
Epoch 6265
Loss = 1.9320e-02, PNorm = 662.9043, GNorm = 0.2336, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.127314
Epoch 6266
Loss = 1.2755e-01, PNorm = 662.9414, GNorm = 4.3867, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.108586
Epoch 6267
Loss = 3.0391e-02, PNorm = 662.9867, GNorm = 0.3158, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.105917
Epoch 6268
Loss = 1.2772e-01, PNorm = 663.0259, GNorm = 8.0818, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.089835
Epoch 6269
Loss = 3.6807e-02, PNorm = 663.0700, GNorm = 1.4898, lr_0 = 9.9572e-04
Loss = 5.7201e-02, PNorm = 663.1145, GNorm = 17.2369, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.117817
Epoch 6270
Loss = 4.4513e-02, PNorm = 663.1594, GNorm = 1.8918, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.139231
Epoch 6271
Loss = 4.3855e-02, PNorm = 663.1891, GNorm = 0.7268, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.102094
Epoch 6272
Loss = 5.4908e-02, PNorm = 663.2206, GNorm = 12.3102, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.103552
Epoch 6273
Loss = 2.0498e-02, PNorm = 663.2605, GNorm = 0.4403, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.185039
Epoch 6274
Loss = 6.3046e-02, PNorm = 663.3003, GNorm = 7.0859, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.148777
Epoch 6275
Loss = 1.0716e-01, PNorm = 663.3406, GNorm = 0.4208, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.103150
Epoch 6276
Loss = 5.3585e-02, PNorm = 663.3869, GNorm = 1.9293, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.113339
Epoch 6277
Loss = 2.5128e-02, PNorm = 663.4349, GNorm = 1.1914, lr_0 = 9.9572e-04
Validation binary_cross_entropy = 0.122617
Epoch 6278
Loss = 4.0475e-02, PNorm = 663.4692, GNorm = 1.7827, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.105199
Epoch 6279
Loss = 3.0557e-02, PNorm = 663.4923, GNorm = 1.5202, lr_0 = 9.9571e-04
Loss = 4.9693e-02, PNorm = 663.5187, GNorm = 0.1815, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.107469
Epoch 6280
Loss = 2.7039e-02, PNorm = 663.5518, GNorm = 0.0735, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.112180
Epoch 6281
Loss = 3.9737e-02, PNorm = 663.5865, GNorm = 0.1790, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.134789
Epoch 6282
Loss = 1.8425e-01, PNorm = 663.6213, GNorm = 1.9993, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.110686
Epoch 6283
Loss = 1.5355e-01, PNorm = 663.7347, GNorm = 0.4699, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.204878
Epoch 6284
Loss = 6.6582e-02, PNorm = 663.8231, GNorm = 2.0191, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.124207
Epoch 6285
Loss = 4.6709e-02, PNorm = 663.8840, GNorm = 1.5175, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.149974
Epoch 6286
Loss = 4.4609e-02, PNorm = 663.9343, GNorm = 1.9435, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.147778
Epoch 6287
Loss = 8.3464e-02, PNorm = 663.9662, GNorm = 2.1375, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.152746
Epoch 6288
Loss = 1.2725e-02, PNorm = 664.0093, GNorm = 0.5118, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.176151
Epoch 6289
Loss = 8.2782e-02, PNorm = 664.0476, GNorm = 2.6816, lr_0 = 9.9571e-04
Loss = 4.0259e-02, PNorm = 664.0793, GNorm = 0.1174, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.170767
Epoch 6290
Loss = 6.5309e-02, PNorm = 664.1066, GNorm = 5.1309, lr_0 = 9.9571e-04
Validation binary_cross_entropy = 0.159296
Epoch 6291
Loss = 9.4116e-02, PNorm = 664.1485, GNorm = 1.2691, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.142237
Epoch 6292
Loss = 4.4765e-02, PNorm = 664.1905, GNorm = 3.6159, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.139794
Epoch 6293
Loss = 3.4479e-02, PNorm = 664.2303, GNorm = 1.2457, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.133724
Epoch 6294
Loss = 1.4017e-02, PNorm = 664.2620, GNorm = 0.7330, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.160693
Epoch 6295
Loss = 1.8915e-02, PNorm = 664.3041, GNorm = 0.4231, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.237809
Epoch 6296
Loss = 2.9591e-02, PNorm = 664.3290, GNorm = 1.9679, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.154013
Epoch 6297
Loss = 2.8737e-02, PNorm = 664.3501, GNorm = 1.6455, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.109245
Epoch 6298
Loss = 5.2064e-02, PNorm = 664.3844, GNorm = 1.0990, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.125626
Epoch 6299
Loss = 5.2125e-02, PNorm = 664.4266, GNorm = 2.1551, lr_0 = 9.9570e-04
Loss = 6.7541e-02, PNorm = 664.4617, GNorm = 1.7665, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.133476
Epoch 6300
Loss = 3.4022e-02, PNorm = 664.4965, GNorm = 0.1169, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.154918
Epoch 6301
Loss = 7.0069e-02, PNorm = 664.5146, GNorm = 6.8740, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.112487
Epoch 6302
Loss = 4.3813e-02, PNorm = 664.5666, GNorm = 1.2129, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.137214
Epoch 6303
Loss = 5.9753e-02, PNorm = 664.6141, GNorm = 0.1486, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.124282
Epoch 6304
Loss = 3.0905e-02, PNorm = 664.6470, GNorm = 0.6565, lr_0 = 9.9570e-04
Validation binary_cross_entropy = 0.113069
Epoch 6305
Loss = 6.4261e-02, PNorm = 664.6798, GNorm = 1.1959, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.122733
Epoch 6306
Loss = 1.4532e-02, PNorm = 664.7159, GNorm = 0.0446, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.122487
Epoch 6307
Loss = 1.3810e-02, PNorm = 664.7380, GNorm = 0.1982, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.122189
Epoch 6308
Loss = 2.5561e-02, PNorm = 664.7600, GNorm = 0.8205, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.121125
Epoch 6309
Loss = 1.8731e-01, PNorm = 664.7779, GNorm = 3.7172, lr_0 = 9.9569e-04
Loss = 3.6199e-02, PNorm = 664.7968, GNorm = 2.0348, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.105740
Epoch 6310
Loss = 2.2571e-02, PNorm = 664.8334, GNorm = 0.2520, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.136973
Epoch 6311
Loss = 3.2480e-02, PNorm = 664.8615, GNorm = 2.0692, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.110584
Epoch 6312
Loss = 1.5769e-02, PNorm = 664.8887, GNorm = 0.9903, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.118353
Epoch 6313
Loss = 1.2211e-02, PNorm = 664.9149, GNorm = 1.1833, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.134733
Epoch 6314
Loss = 1.4489e-02, PNorm = 664.9364, GNorm = 0.7281, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.128716
Epoch 6315
Loss = 1.4696e-02, PNorm = 664.9638, GNorm = 0.1663, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.129353
Epoch 6316
Loss = 2.6370e-02, PNorm = 664.9870, GNorm = 5.0703, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.154249
Epoch 6317
Loss = 5.4451e-03, PNorm = 665.0230, GNorm = 0.2589, lr_0 = 9.9569e-04
Validation binary_cross_entropy = 0.184465
Epoch 6318
Loss = 8.0237e-01, PNorm = 665.0488, GNorm = 1.6263, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.113139
Epoch 6319
Loss = 3.4954e-02, PNorm = 665.0859, GNorm = 2.2502, lr_0 = 9.9568e-04
Loss = 7.6482e-02, PNorm = 665.1370, GNorm = 2.2003, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.144846
Epoch 6320
Loss = 6.3754e-02, PNorm = 665.1693, GNorm = 1.4058, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.098146
Epoch 6321
Loss = 9.3248e-02, PNorm = 665.2091, GNorm = 2.0503, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.106729
Epoch 6322
Loss = 2.6641e-02, PNorm = 665.2558, GNorm = 1.7260, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.109588
Epoch 6323
Loss = 2.4112e-02, PNorm = 665.2851, GNorm = 1.3534, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.101571
Epoch 6324
Loss = 3.1110e-02, PNorm = 665.3182, GNorm = 0.0717, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.128889
Epoch 6325
Loss = 4.1663e-02, PNorm = 665.3470, GNorm = 0.4660, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.122652
Epoch 6326
Loss = 9.9142e-03, PNorm = 665.3808, GNorm = 0.8484, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.114660
Epoch 6327
Loss = 2.4230e-02, PNorm = 665.4059, GNorm = 1.6040, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.113544
Epoch 6328
Loss = 8.4494e-03, PNorm = 665.4315, GNorm = 0.1764, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.145810
Epoch 6329
Loss = 1.1392e-02, PNorm = 665.4679, GNorm = 0.8624, lr_0 = 9.9568e-04
Loss = 2.8454e-02, PNorm = 665.4928, GNorm = 0.1214, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.138161
Epoch 6330
Loss = 2.4764e-02, PNorm = 665.5183, GNorm = 0.5311, lr_0 = 9.9568e-04
Validation binary_cross_entropy = 0.140818
Epoch 6331
Loss = 3.7720e-02, PNorm = 665.5445, GNorm = 0.8014, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.128085
Epoch 6332
Loss = 1.8651e-02, PNorm = 665.5684, GNorm = 0.7868, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.115625
Epoch 6333
Loss = 3.4359e-02, PNorm = 665.6082, GNorm = 0.5576, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.129540
Epoch 6334
Loss = 2.3610e-02, PNorm = 665.6409, GNorm = 1.3106, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.123205
Epoch 6335
Loss = 1.2286e-02, PNorm = 665.6708, GNorm = 0.3130, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.114374
Epoch 6336
Loss = 6.0071e-02, PNorm = 665.7140, GNorm = 4.4867, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.126949
Epoch 6337
Loss = 9.0127e-03, PNorm = 665.7703, GNorm = 0.3033, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.124712
Epoch 6338
Loss = 5.8528e-02, PNorm = 665.8055, GNorm = 0.8697, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.109741
Epoch 6339
Loss = 2.9158e-02, PNorm = 665.8373, GNorm = 0.9477, lr_0 = 9.9567e-04
Loss = 1.2732e-02, PNorm = 665.8721, GNorm = 0.4496, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.122819
Epoch 6340
Loss = 2.5175e-02, PNorm = 665.8942, GNorm = 0.7291, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.121261
Epoch 6341
Loss = 3.4089e-02, PNorm = 665.9204, GNorm = 2.0430, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.128971
Epoch 6342
Loss = 3.5016e-02, PNorm = 665.9473, GNorm = 1.2998, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.117282
Epoch 6343
Loss = 1.4834e-02, PNorm = 665.9732, GNorm = 0.5078, lr_0 = 9.9567e-04
Validation binary_cross_entropy = 0.111393
Epoch 6344
Loss = 2.0330e-02, PNorm = 665.9961, GNorm = 2.5062, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.117472
Epoch 6345
Loss = 2.0328e-02, PNorm = 666.0234, GNorm = 1.3924, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.125553
Epoch 6346
Loss = 4.6208e-02, PNorm = 666.0568, GNorm = 2.4669, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.105576
Epoch 6347
Loss = 2.9665e-02, PNorm = 666.0860, GNorm = 1.0968, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.102904
Epoch 6348
Loss = 1.8170e-02, PNorm = 666.1283, GNorm = 1.0941, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.130156
Epoch 6349
Loss = 3.8544e-01, PNorm = 666.1704, GNorm = 8.0857, lr_0 = 9.9566e-04
Loss = 2.2137e-02, PNorm = 666.2062, GNorm = 0.6937, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.137237
Epoch 6350
Loss = 2.5087e-02, PNorm = 666.2467, GNorm = 0.0748, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.192741
Epoch 6351
Loss = 4.9449e-02, PNorm = 666.2939, GNorm = 0.1719, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.083858
Epoch 6352
Loss = 2.2474e-02, PNorm = 666.3549, GNorm = 1.4775, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.087591
Epoch 6353
Loss = 1.4474e-02, PNorm = 666.4184, GNorm = 0.1841, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.114518
Epoch 6354
Loss = 1.4280e-02, PNorm = 666.4602, GNorm = 2.5685, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.126103
Epoch 6355
Loss = 7.2139e-03, PNorm = 666.4806, GNorm = 1.6434, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.139702
Epoch 6356
Loss = 4.5705e-02, PNorm = 666.5025, GNorm = 2.5044, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.115753
Epoch 6357
Loss = 4.3034e-02, PNorm = 666.5185, GNorm = 0.4312, lr_0 = 9.9566e-04
Validation binary_cross_entropy = 0.097600
Epoch 6358
Loss = 1.2888e-02, PNorm = 666.5453, GNorm = 1.1893, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.108863
Epoch 6359
Loss = 5.7514e-02, PNorm = 666.5862, GNorm = 2.2165, lr_0 = 9.9565e-04
Loss = 3.7386e-02, PNorm = 666.6116, GNorm = 0.6080, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.100906
Epoch 6360
Loss = 6.8149e-02, PNorm = 666.6376, GNorm = 5.5821, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.100056
Epoch 6361
Loss = 2.6748e-02, PNorm = 666.6664, GNorm = 0.2854, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.100329
Epoch 6362
Loss = 3.0763e-02, PNorm = 666.6952, GNorm = 0.0909, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.099788
Epoch 6363
Loss = 1.3790e-02, PNorm = 666.7208, GNorm = 0.3369, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.108191
Epoch 6364
Loss = 1.3116e-02, PNorm = 666.7545, GNorm = 1.8361, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.184561
Epoch 6365
Loss = 2.0925e-02, PNorm = 666.7899, GNorm = 0.0154, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.177873
Epoch 6366
Loss = 1.8203e-02, PNorm = 666.8058, GNorm = 0.2705, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.146896
Epoch 6367
Loss = 2.1747e-02, PNorm = 666.8249, GNorm = 0.0624, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.164017
Epoch 6368
Loss = 1.9775e-02, PNorm = 666.8579, GNorm = 1.4771, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.203339
Epoch 6369
Loss = 3.2157e-02, PNorm = 666.8981, GNorm = 4.5547, lr_0 = 9.9565e-04
Loss = 2.4339e-02, PNorm = 666.9256, GNorm = 0.0327, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.188264
Epoch 6370
Loss = 3.5027e-02, PNorm = 666.9440, GNorm = 2.7906, lr_0 = 9.9565e-04
Validation binary_cross_entropy = 0.116798
Epoch 6371
Loss = 2.1546e-02, PNorm = 666.9859, GNorm = 0.1297, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.131737
Epoch 6372
Loss = 4.3483e-02, PNorm = 667.0246, GNorm = 1.7304, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.123849
Epoch 6373
Loss = 4.8804e-02, PNorm = 667.0551, GNorm = 2.3719, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.112726
Epoch 6374
Loss = 2.6267e-02, PNorm = 667.0797, GNorm = 1.0503, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.111746
Epoch 6375
Loss = 4.3262e-02, PNorm = 667.1189, GNorm = 0.5935, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.126516
Epoch 6376
Loss = 1.8056e-02, PNorm = 667.1660, GNorm = 0.3251, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.188588
Epoch 6377
Loss = 1.6062e-01, PNorm = 667.2083, GNorm = 8.5278, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.092450
Epoch 6378
Loss = 1.4372e-02, PNorm = 667.2460, GNorm = 0.2518, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.098249
Epoch 6379
Loss = 2.2145e-02, PNorm = 667.3035, GNorm = 1.0230, lr_0 = 9.9564e-04
Loss = 3.9635e-02, PNorm = 667.3506, GNorm = 0.2386, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.105844
Epoch 6380
Loss = 1.2606e-02, PNorm = 667.3863, GNorm = 0.1526, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.109182
Epoch 6381
Loss = 4.5616e-02, PNorm = 667.4121, GNorm = 4.1568, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.103495
Epoch 6382
Loss = 1.0163e-02, PNorm = 667.4386, GNorm = 0.1067, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.098813
Epoch 6383
Loss = 3.8611e-02, PNorm = 667.4861, GNorm = 0.3863, lr_0 = 9.9564e-04
Validation binary_cross_entropy = 0.124201
Epoch 6384
Loss = 1.0619e-02, PNorm = 667.5337, GNorm = 0.4188, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.134962
Epoch 6385
Loss = 1.3912e-02, PNorm = 667.5643, GNorm = 1.4572, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.129419
Epoch 6386
Loss = 1.2430e-02, PNorm = 667.5870, GNorm = 0.9467, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.130530
Epoch 6387
Loss = 6.6298e-04, PNorm = 667.6030, GNorm = 0.1585, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.111431
Epoch 6388
Loss = 1.0037e-03, PNorm = 667.6292, GNorm = 0.0600, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.147235
Epoch 6389
Loss = 2.6850e-03, PNorm = 667.6722, GNorm = 0.3334, lr_0 = 9.9563e-04
Loss = 4.7096e-02, PNorm = 667.7093, GNorm = 5.5938, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.113458
Epoch 6390
Loss = 5.4895e-02, PNorm = 667.7524, GNorm = 2.7408, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.103351
Epoch 6391
Loss = 1.4900e-02, PNorm = 667.8118, GNorm = 0.9541, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.117144
Epoch 6392
Loss = 1.1730e-01, PNorm = 667.8437, GNorm = 2.8147, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.097168
Epoch 6393
Loss = 2.8197e-02, PNorm = 667.8769, GNorm = 1.1807, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.094589
Epoch 6394
Loss = 3.6266e-02, PNorm = 667.9126, GNorm = 1.6779, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.099168
Epoch 6395
Loss = 6.6147e-02, PNorm = 667.9468, GNorm = 2.8408, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.095407
Epoch 6396
Loss = 2.2165e-02, PNorm = 667.9762, GNorm = 1.6628, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.104344
Epoch 6397
Loss = 4.2504e-03, PNorm = 668.0047, GNorm = 0.3280, lr_0 = 9.9563e-04
Validation binary_cross_entropy = 0.104012
Epoch 6398
Loss = 2.5473e-03, PNorm = 668.0361, GNorm = 0.1924, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.105781
Epoch 6399
Loss = 8.3358e-02, PNorm = 668.0840, GNorm = 4.0408, lr_0 = 9.9562e-04
Loss = 4.0831e-02, PNorm = 668.1218, GNorm = 0.4529, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.112048
Epoch 6400
Loss = 2.2036e-02, PNorm = 668.1464, GNorm = 0.9734, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.105030
Epoch 6401
Loss = 1.1447e-02, PNorm = 668.1730, GNorm = 2.7494, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.103093
Epoch 6402
Loss = 4.7021e-02, PNorm = 668.2061, GNorm = 1.9774, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.102303
Epoch 6403
Loss = 1.2779e-02, PNorm = 668.2446, GNorm = 0.8108, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.113000
Epoch 6404
Loss = 1.0514e-02, PNorm = 668.2742, GNorm = 0.3560, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.119393
Epoch 6405
Loss = 6.3302e-02, PNorm = 668.2938, GNorm = 2.3756, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.105972
Epoch 6406
Loss = 6.5843e-03, PNorm = 668.3240, GNorm = 0.1608, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.111571
Epoch 6407
Loss = 8.2097e-03, PNorm = 668.3558, GNorm = 0.1524, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.121958
Epoch 6408
Loss = 3.4051e-03, PNorm = 668.3786, GNorm = 0.1278, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.118653
Epoch 6409
Loss = 2.1489e-02, PNorm = 668.4063, GNorm = 1.3866, lr_0 = 9.9562e-04
Loss = 1.4613e-02, PNorm = 668.4368, GNorm = 4.0081, lr_0 = 9.9562e-04
Validation binary_cross_entropy = 0.126221
Epoch 6410
Loss = 3.9190e-02, PNorm = 668.4618, GNorm = 3.5691, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.117160
Epoch 6411
Loss = 3.7092e-02, PNorm = 668.5017, GNorm = 1.9792, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.118123
Epoch 6412
Loss = 3.2160e-02, PNorm = 668.5384, GNorm = 0.4299, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.105036
Epoch 6413
Loss = 1.7368e-02, PNorm = 668.5784, GNorm = 0.3472, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.102147
Epoch 6414
Loss = 2.2368e-02, PNorm = 668.6234, GNorm = 0.3968, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.115807
Epoch 6415
Loss = 1.5094e-02, PNorm = 668.6582, GNorm = 1.6141, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.125411
Epoch 6416
Loss = 9.1087e-03, PNorm = 668.6837, GNorm = 0.2355, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.115040
Epoch 6417
Loss = 4.5830e-02, PNorm = 668.7046, GNorm = 0.0573, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.108331
Epoch 6418
Loss = 4.2849e-03, PNorm = 668.7328, GNorm = 0.3730, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.108410
Epoch 6419
Loss = 3.4762e-03, PNorm = 668.7648, GNorm = 0.4856, lr_0 = 9.9561e-04
Loss = 6.0719e-03, PNorm = 668.8009, GNorm = 1.8299, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.115743
Epoch 6420
Loss = 2.5208e-02, PNorm = 668.8278, GNorm = 0.1729, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.109236
Epoch 6421
Loss = 4.0699e-02, PNorm = 668.8690, GNorm = 0.4802, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.114651
Epoch 6422
Loss = 7.7183e-03, PNorm = 668.9041, GNorm = 0.0757, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.120151
Epoch 6423
Loss = 3.8003e-03, PNorm = 668.9347, GNorm = 0.1269, lr_0 = 9.9561e-04
Validation binary_cross_entropy = 0.104870
Epoch 6424
Loss = 6.0224e-02, PNorm = 668.9634, GNorm = 1.1756, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.081683
Epoch 6425
Loss = 5.6473e-02, PNorm = 669.0536, GNorm = 1.3479, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.127556
Epoch 6426
Loss = 1.9463e-02, PNorm = 669.1258, GNorm = 1.4527, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.114352
Epoch 6427
Loss = 1.8505e-02, PNorm = 669.1641, GNorm = 0.5617, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.111199
Epoch 6428
Loss = 8.9602e-02, PNorm = 669.2147, GNorm = 7.8442, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.122142
Epoch 6429
Loss = 2.9801e-03, PNorm = 669.2576, GNorm = 0.1406, lr_0 = 9.9560e-04
Loss = 2.1306e-02, PNorm = 669.2913, GNorm = 1.8922, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.103227
Epoch 6430
Loss = 3.8099e-02, PNorm = 669.3290, GNorm = 0.3562, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.110765
Epoch 6431
Loss = 1.7569e-02, PNorm = 669.3576, GNorm = 0.1884, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.106331
Epoch 6432
Loss = 3.1136e-02, PNorm = 669.3866, GNorm = 0.7709, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.087865
Epoch 6433
Loss = 1.8827e-01, PNorm = 669.5351, GNorm = 19.6276, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.374364
Epoch 6434
Loss = 2.1599e-01, PNorm = 669.7226, GNorm = 2.3375, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.084261
Epoch 6435
Loss = 1.0005e-01, PNorm = 669.8411, GNorm = 2.1182, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.114641
Epoch 6436
Loss = 3.4063e-02, PNorm = 669.9301, GNorm = 1.8877, lr_0 = 9.9560e-04
Validation binary_cross_entropy = 0.112598
Epoch 6437
Loss = 5.9848e-02, PNorm = 669.9941, GNorm = 0.4025, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.083218
Epoch 6438
Loss = 1.6676e-02, PNorm = 670.0474, GNorm = 0.7293, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.092443
Epoch 6439
Loss = 1.8666e-02, PNorm = 670.0931, GNorm = 0.6775, lr_0 = 9.9559e-04
Loss = 5.7661e-02, PNorm = 670.1276, GNorm = 1.3535, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.089391
Epoch 6440
Loss = 3.2064e-02, PNorm = 670.1658, GNorm = 3.4259, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.096858
Epoch 6441
Loss = 6.3715e-02, PNorm = 670.2013, GNorm = 6.6406, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.086045
Epoch 6442
Loss = 1.4379e-02, PNorm = 670.2406, GNorm = 0.2782, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.094623
Epoch 6443
Loss = 5.9659e-02, PNorm = 670.2989, GNorm = 0.2777, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.231690
Epoch 6444
Loss = 4.0323e-02, PNorm = 670.3462, GNorm = 2.3060, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.109153
Epoch 6445
Loss = 4.1516e-02, PNorm = 670.3807, GNorm = 0.0948, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.097585
Epoch 6446
Loss = 1.5909e-02, PNorm = 670.4332, GNorm = 0.1198, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.111633
Epoch 6447
Loss = 2.2534e-02, PNorm = 670.4734, GNorm = 1.1716, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.103460
Epoch 6448
Loss = 6.8477e-03, PNorm = 670.4973, GNorm = 0.1508, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.123810
Epoch 6449
Loss = 4.6962e-03, PNorm = 670.5274, GNorm = 0.6017, lr_0 = 9.9559e-04
Loss = 3.4798e-02, PNorm = 670.5660, GNorm = 0.3106, lr_0 = 9.9559e-04
Validation binary_cross_entropy = 0.107752
Epoch 6450
Loss = 1.2338e-02, PNorm = 670.6152, GNorm = 0.8033, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.112220
Epoch 6451
Loss = 8.6169e-03, PNorm = 670.6572, GNorm = 0.4565, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.123854
Epoch 6452
Loss = 6.3077e-02, PNorm = 670.6851, GNorm = 0.4990, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.089664
Epoch 6453
Loss = 9.4745e-02, PNorm = 670.7223, GNorm = 2.6160, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.087323
Epoch 6454
Loss = 2.6238e-02, PNorm = 670.7678, GNorm = 1.5169, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.086101
Epoch 6455
Loss = 7.3709e-03, PNorm = 670.7995, GNorm = 0.1834, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.078938
Epoch 6456
Loss = 1.6240e-02, PNorm = 670.8241, GNorm = 0.1721, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.079136
Epoch 6457
Loss = 2.0214e-02, PNorm = 670.8555, GNorm = 2.4240, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.095433
Epoch 6458
Loss = 2.7445e-02, PNorm = 670.8960, GNorm = 0.3171, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.099923
Epoch 6459
Loss = 1.7777e-01, PNorm = 670.9296, GNorm = 3.1763, lr_0 = 9.9558e-04
Loss = 1.8458e-02, PNorm = 670.9676, GNorm = 1.9957, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.094895
Epoch 6460
Loss = 1.2398e-02, PNorm = 671.0099, GNorm = 0.1108, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.115573
Epoch 6461
Loss = 3.7265e-02, PNorm = 671.0360, GNorm = 1.1106, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.110746
Epoch 6462
Loss = 5.2626e-03, PNorm = 671.0603, GNorm = 0.0721, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.114340
Epoch 6463
Loss = 9.2884e-02, PNorm = 671.0808, GNorm = 1.3770, lr_0 = 9.9558e-04
Validation binary_cross_entropy = 0.136945
Epoch 6464
Loss = 1.3646e-02, PNorm = 671.1273, GNorm = 0.0149, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.103497
Epoch 6465
Loss = 1.3779e-02, PNorm = 671.1674, GNorm = 0.9998, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.099463
Epoch 6466
Loss = 1.1644e-02, PNorm = 671.2021, GNorm = 1.2502, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.105193
Epoch 6467
Loss = 1.9384e-02, PNorm = 671.2323, GNorm = 3.1627, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.092234
Epoch 6468
Loss = 3.0186e-02, PNorm = 671.2728, GNorm = 0.7266, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.080498
Epoch 6469
Loss = 4.9601e-03, PNorm = 671.3267, GNorm = 0.2832, lr_0 = 9.9557e-04
Loss = 4.4912e-02, PNorm = 671.3755, GNorm = 3.8321, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.082854
Epoch 6470
Loss = 2.2521e-02, PNorm = 671.4329, GNorm = 0.1218, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.109061
Epoch 6471
Loss = 6.2522e-02, PNorm = 671.4820, GNorm = 3.6357, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.130538
Epoch 6472
Loss = 2.7936e-02, PNorm = 671.5091, GNorm = 0.5316, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.095352
Epoch 6473
Loss = 1.3753e-02, PNorm = 671.5425, GNorm = 0.6822, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.095201
Epoch 6474
Loss = 1.6775e-02, PNorm = 671.5780, GNorm = 0.6236, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.114868
Epoch 6475
Loss = 9.3048e-03, PNorm = 671.6078, GNorm = 0.1364, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.110648
Epoch 6476
Loss = 1.2518e-02, PNorm = 671.6336, GNorm = 1.0434, lr_0 = 9.9557e-04
Validation binary_cross_entropy = 0.105673
Epoch 6477
Loss = 1.4848e-01, PNorm = 671.6536, GNorm = 5.7297, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.083627
Epoch 6478
Loss = 2.6092e-02, PNorm = 671.6762, GNorm = 1.0988, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.097484
Epoch 6479
Loss = 5.5769e-02, PNorm = 671.7326, GNorm = 1.8979, lr_0 = 9.9556e-04
Loss = 3.2015e-02, PNorm = 671.7736, GNorm = 0.9216, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.095778
Epoch 6480
Loss = 2.6838e-02, PNorm = 671.8022, GNorm = 0.2003, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.083175
Epoch 6481
Loss = 4.3730e-02, PNorm = 671.8320, GNorm = 1.0715, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.083631
Epoch 6482
Loss = 1.0894e-02, PNorm = 671.8879, GNorm = 0.2249, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.092764
Epoch 6483
Loss = 6.9627e-02, PNorm = 671.9187, GNorm = 1.6633, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.085306
Epoch 6484
Loss = 2.1084e-02, PNorm = 671.9461, GNorm = 3.5564, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.083982
Epoch 6485
Loss = 1.1192e-02, PNorm = 671.9829, GNorm = 1.8754, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.095734
Epoch 6486
Loss = 1.1371e-02, PNorm = 672.0199, GNorm = 1.1982, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.097714
Epoch 6487
Loss = 1.1068e-02, PNorm = 672.0521, GNorm = 1.3891, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.109840
Epoch 6488
Loss = 3.1970e-02, PNorm = 672.0889, GNorm = 1.7275, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.099399
Epoch 6489
Loss = 1.0061e-02, PNorm = 672.1253, GNorm = 0.6927, lr_0 = 9.9556e-04
Loss = 8.7351e-03, PNorm = 672.1557, GNorm = 2.0860, lr_0 = 9.9556e-04
Validation binary_cross_entropy = 0.105915
Epoch 6490
Loss = 1.9781e-02, PNorm = 672.1892, GNorm = 0.0911, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.100354
Epoch 6491
Loss = 1.9762e-02, PNorm = 672.2378, GNorm = 0.0793, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.106797
Epoch 6492
Loss = 2.7353e-02, PNorm = 672.2830, GNorm = 0.2323, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.113075
Epoch 6493
Loss = 7.0257e-02, PNorm = 672.3240, GNorm = 5.6049, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.098870
Epoch 6494
Loss = 5.3389e-02, PNorm = 672.3640, GNorm = 3.4235, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.098019
Epoch 6495
Loss = 3.1564e-02, PNorm = 672.4426, GNorm = 0.6005, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.116455
Epoch 6496
Loss = 1.5639e-02, PNorm = 672.5016, GNorm = 1.1901, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.100518
Epoch 6497
Loss = 1.1262e-02, PNorm = 672.5418, GNorm = 0.1797, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.100570
Epoch 6498
Loss = 1.3056e-02, PNorm = 672.5760, GNorm = 1.4608, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.097225
Epoch 6499
Loss = 2.0464e-03, PNorm = 672.6112, GNorm = 0.2015, lr_0 = 9.9555e-04
Loss = 3.8138e-02, PNorm = 672.6561, GNorm = 0.3015, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.089946
Epoch 6500
Loss = 2.6798e-02, PNorm = 672.7048, GNorm = 0.0458, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.097389
Epoch 6501
Loss = 1.6859e-02, PNorm = 672.7438, GNorm = 0.1895, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.087857
Epoch 6502
Loss = 8.3599e-03, PNorm = 672.7742, GNorm = 0.2287, lr_0 = 9.9555e-04
Validation binary_cross_entropy = 0.088056
Epoch 6503
Loss = 3.6952e-02, PNorm = 672.8086, GNorm = 0.4309, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.091571
Epoch 6504
Loss = 5.0394e-03, PNorm = 672.8450, GNorm = 0.3114, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.098993
Epoch 6505
Loss = 1.1431e-02, PNorm = 672.8832, GNorm = 0.3068, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.105918
Epoch 6506
Loss = 2.1451e-01, PNorm = 672.9119, GNorm = 0.5706, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.080824
Epoch 6507
Loss = 1.9534e-02, PNorm = 672.9765, GNorm = 0.4725, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.090631
Epoch 6508
Loss = 9.0104e-02, PNorm = 673.0333, GNorm = 0.8494, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.094105
Epoch 6509
Loss = 1.1893e-02, PNorm = 673.0720, GNorm = 3.1630, lr_0 = 9.9554e-04
Loss = 2.2873e-02, PNorm = 673.1068, GNorm = 0.3696, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.094276
Epoch 6510
Loss = 5.8522e-02, PNorm = 673.1408, GNorm = 1.1389, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.088822
Epoch 6511
Loss = 1.9532e-02, PNorm = 673.1784, GNorm = 0.8373, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.091614
Epoch 6512
Loss = 1.8622e-02, PNorm = 673.2328, GNorm = 0.8162, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.107018
Epoch 6513
Loss = 4.6005e-02, PNorm = 673.2647, GNorm = 1.7526, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.098197
Epoch 6514
Loss = 7.3143e-02, PNorm = 673.2935, GNorm = 2.7007, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.088744
Epoch 6515
Loss = 2.6255e-02, PNorm = 673.3316, GNorm = 1.1696, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.095195
Epoch 6516
Loss = 8.2511e-02, PNorm = 673.3875, GNorm = 11.3460, lr_0 = 9.9554e-04
Validation binary_cross_entropy = 0.096853
Epoch 6517
Loss = 1.8703e-02, PNorm = 673.4304, GNorm = 1.6612, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.100298
Epoch 6518
Loss = 9.3082e-02, PNorm = 673.4714, GNorm = 3.6654, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.126867
Epoch 6519
Loss = 1.7279e-02, PNorm = 673.5139, GNorm = 1.0364, lr_0 = 9.9553e-04
Loss = 1.3224e-02, PNorm = 673.5369, GNorm = 1.0723, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.111621
Epoch 6520
Loss = 1.4097e-02, PNorm = 673.5603, GNorm = 0.4086, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.106969
Epoch 6521
Loss = 1.8275e-02, PNorm = 673.5987, GNorm = 0.2529, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.104628
Epoch 6522
Loss = 3.2179e-02, PNorm = 673.6519, GNorm = 0.1943, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.100941
Epoch 6523
Loss = 4.7418e-02, PNorm = 673.7035, GNorm = 2.8158, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.103739
Epoch 6524
Loss = 9.1698e-03, PNorm = 673.7430, GNorm = 0.1602, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.096619
Epoch 6525
Loss = 1.9102e-02, PNorm = 673.7848, GNorm = 0.2506, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.105947
Epoch 6526
Loss = 1.4752e-02, PNorm = 673.8315, GNorm = 1.2753, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.113678
Epoch 6527
Loss = 3.5099e-02, PNorm = 673.8625, GNorm = 0.1469, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.118106
Epoch 6528
Loss = 1.4877e-02, PNorm = 673.9091, GNorm = 0.1627, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.110316
Epoch 6529
Loss = 6.2393e-04, PNorm = 673.9555, GNorm = 0.0592, lr_0 = 9.9553e-04
Loss = 2.3516e-02, PNorm = 673.9975, GNorm = 0.1693, lr_0 = 9.9553e-04
Validation binary_cross_entropy = 0.103776
Epoch 6530
Loss = 2.1185e-02, PNorm = 674.0422, GNorm = 0.9960, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.101694
Epoch 6531
Loss = 3.1848e-02, PNorm = 674.0903, GNorm = 0.3923, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.164913
Epoch 6532
Loss = 5.1303e-02, PNorm = 674.1333, GNorm = 0.8306, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.105429
Epoch 6533
Loss = 1.1362e-02, PNorm = 674.1676, GNorm = 0.1437, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.106218
Epoch 6534
Loss = 4.2897e-02, PNorm = 674.2102, GNorm = 0.4261, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.129688
Epoch 6535
Loss = 8.6247e-03, PNorm = 674.2530, GNorm = 0.0895, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.149775
Epoch 6536
Loss = 3.0911e-02, PNorm = 674.2825, GNorm = 3.3002, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.115850
Epoch 6537
Loss = 5.6660e-02, PNorm = 674.3149, GNorm = 0.4953, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.122339
Epoch 6538
Loss = 3.0355e-02, PNorm = 674.3980, GNorm = 2.2426, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.162229
Epoch 6539
Loss = 9.1668e-03, PNorm = 674.4709, GNorm = 0.9842, lr_0 = 9.9552e-04
Loss = 1.7246e-02, PNorm = 674.5208, GNorm = 0.1970, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.172608
Epoch 6540
Loss = 5.8654e-02, PNorm = 674.5532, GNorm = 4.3463, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.089492
Epoch 6541
Loss = 6.0768e-02, PNorm = 674.6386, GNorm = 0.2482, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.119962
Epoch 6542
Loss = 7.4361e-02, PNorm = 674.7237, GNorm = 4.3204, lr_0 = 9.9552e-04
Validation binary_cross_entropy = 0.121186
Epoch 6543
Loss = 2.0878e-02, PNorm = 674.7833, GNorm = 1.2038, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.104842
Epoch 6544
Loss = 1.7691e-02, PNorm = 674.8246, GNorm = 0.4692, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.098445
Epoch 6545
Loss = 4.5167e-03, PNorm = 674.8670, GNorm = 0.2798, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.123053
Epoch 6546
Loss = 2.0737e-02, PNorm = 674.9103, GNorm = 0.0098, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.143744
Epoch 6547
Loss = 9.1908e-03, PNorm = 674.9392, GNorm = 0.2861, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.145379
Epoch 6548
Loss = 3.6253e-03, PNorm = 674.9571, GNorm = 0.5164, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.145179
Epoch 6549
Loss = 9.8053e-05, PNorm = 674.9783, GNorm = 0.0098, lr_0 = 9.9551e-04
Loss = 1.9470e-02, PNorm = 674.9951, GNorm = 0.2998, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.144508
Epoch 6550
Loss = 2.7521e-02, PNorm = 675.0232, GNorm = 2.2867, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.140722
Epoch 6551
Loss = 4.2037e-02, PNorm = 675.0471, GNorm = 0.3692, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.108006
Epoch 6552
Loss = 1.4756e-02, PNorm = 675.0835, GNorm = 0.1100, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.109456
Epoch 6553
Loss = 7.4401e-03, PNorm = 675.1182, GNorm = 0.1425, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.114022
Epoch 6554
Loss = 1.3842e-02, PNorm = 675.1481, GNorm = 0.0467, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.116512
Epoch 6555
Loss = 3.2412e-02, PNorm = 675.1763, GNorm = 0.5755, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.112319
Epoch 6556
Loss = 2.9828e-02, PNorm = 675.2065, GNorm = 0.1839, lr_0 = 9.9551e-04
Validation binary_cross_entropy = 0.122387
Epoch 6557
Loss = 1.0047e-02, PNorm = 675.2500, GNorm = 1.4364, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.083248
Epoch 6558
Loss = 3.2384e-02, PNorm = 675.2914, GNorm = 0.5820, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.094555
Epoch 6559
Loss = 3.3083e-02, PNorm = 675.3876, GNorm = 2.8309, lr_0 = 9.9550e-04
Loss = 4.8271e-02, PNorm = 675.4563, GNorm = 2.2156, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.101571
Epoch 6560
Loss = 4.1877e-02, PNorm = 675.5109, GNorm = 0.2961, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.083018
Epoch 6561
Loss = 3.0898e-02, PNorm = 675.5567, GNorm = 0.2392, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.090137
Epoch 6562
Loss = 4.0611e-02, PNorm = 675.5930, GNorm = 2.1185, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.087484
Epoch 6563
Loss = 2.4656e-02, PNorm = 675.6331, GNorm = 0.8212, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.107014
Epoch 6564
Loss = 6.5118e-02, PNorm = 675.6597, GNorm = 1.0900, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.079114
Epoch 6565
Loss = 1.9082e-02, PNorm = 675.6868, GNorm = 0.1208, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.079398
Epoch 6566
Loss = 2.8945e-02, PNorm = 675.7202, GNorm = 0.5060, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.083046
Epoch 6567
Loss = 9.8671e-03, PNorm = 675.7624, GNorm = 0.1806, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.093774
Epoch 6568
Loss = 1.6052e-02, PNorm = 675.7939, GNorm = 0.5439, lr_0 = 9.9550e-04
Validation binary_cross_entropy = 0.069153
Epoch 6569
Loss = 2.3838e-02, PNorm = 675.8238, GNorm = 1.4594, lr_0 = 9.9550e-04
Loss = 5.7016e-02, PNorm = 675.8666, GNorm = 1.0372, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.081483
Epoch 6570
Loss = 5.0912e-02, PNorm = 675.8987, GNorm = 1.0727, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.069778
Epoch 6571
Loss = 3.7443e-02, PNorm = 675.9333, GNorm = 0.2428, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.061819
Epoch 6572
Loss = 4.2794e-02, PNorm = 675.9904, GNorm = 0.3802, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.081486
Epoch 6573
Loss = 5.5419e-02, PNorm = 676.0284, GNorm = 3.3788, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.075879
Epoch 6574
Loss = 1.4300e-02, PNorm = 676.0544, GNorm = 0.2887, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.071923
Epoch 6575
Loss = 1.2571e-02, PNorm = 676.0839, GNorm = 0.1827, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.074766
Epoch 6576
Loss = 3.3354e-02, PNorm = 676.1194, GNorm = 2.9708, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.086867
Epoch 6577
Loss = 1.0263e-02, PNorm = 676.1621, GNorm = 0.1456, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.079661
Epoch 6578
Loss = 3.9260e-03, PNorm = 676.1862, GNorm = 0.3028, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.077706
Epoch 6579
Loss = 6.7432e-03, PNorm = 676.2120, GNorm = 0.3209, lr_0 = 9.9549e-04
Loss = 4.8607e-03, PNorm = 676.2390, GNorm = 0.0750, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.093342
Epoch 6580
Loss = 3.6848e-02, PNorm = 676.2611, GNorm = 0.1051, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.080550
Epoch 6581
Loss = 2.0928e-02, PNorm = 676.2893, GNorm = 1.4889, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.075382
Epoch 6582
Loss = 1.3915e-02, PNorm = 676.3255, GNorm = 0.5026, lr_0 = 9.9549e-04
Validation binary_cross_entropy = 0.089720
Epoch 6583
Loss = 1.3649e-02, PNorm = 676.3633, GNorm = 0.5535, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.110776
Epoch 6584
Loss = 1.0591e-01, PNorm = 676.3848, GNorm = 0.1175, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.070118
Epoch 6585
Loss = 1.3488e-02, PNorm = 676.4382, GNorm = 0.1441, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.090531
Epoch 6586
Loss = 5.1667e-02, PNorm = 676.4931, GNorm = 0.1178, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.080046
Epoch 6587
Loss = 5.0718e-03, PNorm = 676.5269, GNorm = 0.2466, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.096399
Epoch 6588
Loss = 1.9161e-02, PNorm = 676.5733, GNorm = 1.2660, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.082546
Epoch 6589
Loss = 7.0445e-04, PNorm = 676.6016, GNorm = 0.0318, lr_0 = 9.9548e-04
Loss = 4.4938e-02, PNorm = 676.6306, GNorm = 5.6546, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.076871
Epoch 6590
Loss = 2.8541e-02, PNorm = 676.6741, GNorm = 0.3134, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.076906
Epoch 6591
Loss = 3.5576e-02, PNorm = 676.7226, GNorm = 2.6295, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.088401
Epoch 6592
Loss = 1.5298e-02, PNorm = 676.7552, GNorm = 0.8602, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.086546
Epoch 6593
Loss = 2.7810e-02, PNorm = 676.7832, GNorm = 1.3624, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.100109
Epoch 6594
Loss = 5.2990e-02, PNorm = 676.8389, GNorm = 7.6320, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.114691
Epoch 6595
Loss = 3.0843e-02, PNorm = 676.8768, GNorm = 1.3640, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.096168
Epoch 6596
Loss = 4.7377e-03, PNorm = 676.9245, GNorm = 0.5093, lr_0 = 9.9548e-04
Validation binary_cross_entropy = 0.108922
Epoch 6597
Loss = 8.6761e-03, PNorm = 676.9659, GNorm = 1.7441, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.114889
Epoch 6598
Loss = 8.3187e-02, PNorm = 677.0012, GNorm = 3.8283, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.104635
Epoch 6599
Loss = 1.7092e-03, PNorm = 677.0367, GNorm = 0.0600, lr_0 = 9.9547e-04
Loss = 3.0301e-02, PNorm = 677.0677, GNorm = 4.2733, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.103970
Epoch 6600
Loss = 3.3030e-02, PNorm = 677.0928, GNorm = 1.7625, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.091383
Epoch 6601
Loss = 2.4959e-02, PNorm = 677.1231, GNorm = 0.1945, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.088366
Epoch 6602
Loss = 2.1214e-02, PNorm = 677.1680, GNorm = 0.0917, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.089074
Epoch 6603
Loss = 6.0298e-02, PNorm = 677.2101, GNorm = 0.0591, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.097549
Epoch 6604
Loss = 3.4818e-02, PNorm = 677.2434, GNorm = 0.3337, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.097122
Epoch 6605
Loss = 4.0847e-02, PNorm = 677.2742, GNorm = 3.8506, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.105045
Epoch 6606
Loss = 5.3785e-02, PNorm = 677.3084, GNorm = 2.8143, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.089457
Epoch 6607
Loss = 9.2713e-03, PNorm = 677.3386, GNorm = 0.1742, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.088039
Epoch 6608
Loss = 1.0481e-02, PNorm = 677.3889, GNorm = 0.4518, lr_0 = 9.9547e-04
Validation binary_cross_entropy = 0.097292
Epoch 6609
Loss = 5.9267e-03, PNorm = 677.4373, GNorm = 0.1573, lr_0 = 9.9547e-04
Loss = 1.6625e-02, PNorm = 677.4782, GNorm = 1.2400, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.109741
Epoch 6610
Loss = 2.1103e-02, PNorm = 677.5100, GNorm = 0.0898, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.118972
Epoch 6611
Loss = 1.1595e-02, PNorm = 677.5423, GNorm = 0.1006, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.126688
Epoch 6612
Loss = 7.9289e-03, PNorm = 677.5729, GNorm = 1.0265, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.132607
Epoch 6613
Loss = 3.1259e-02, PNorm = 677.6054, GNorm = 1.0278, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.139269
Epoch 6614
Loss = 1.4014e-02, PNorm = 677.6342, GNorm = 0.1155, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.132925
Epoch 6615
Loss = 4.7875e-03, PNorm = 677.6599, GNorm = 0.2972, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.136891
Epoch 6616
Loss = 9.9473e-02, PNorm = 677.6744, GNorm = 0.2001, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.117475
Epoch 6617
Loss = 8.1292e-02, PNorm = 677.6960, GNorm = 0.6823, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.109379
Epoch 6618
Loss = 4.0938e-02, PNorm = 677.7484, GNorm = 2.7689, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.112276
Epoch 6619
Loss = 1.5903e-02, PNorm = 677.7977, GNorm = 0.9981, lr_0 = 9.9546e-04
Loss = 2.5392e-02, PNorm = 677.8373, GNorm = 0.0494, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.110419
Epoch 6620
Loss = 2.4567e-02, PNorm = 677.8832, GNorm = 0.1898, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.120159
Epoch 6621
Loss = 2.2801e-03, PNorm = 677.9172, GNorm = 0.1108, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.179737
Epoch 6622
Loss = 6.1023e-02, PNorm = 677.9371, GNorm = 0.3968, lr_0 = 9.9546e-04
Validation binary_cross_entropy = 0.109191
Epoch 6623
Loss = 2.1405e-02, PNorm = 677.9869, GNorm = 0.2966, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.113499
Epoch 6624
Loss = 1.3278e-01, PNorm = 678.0385, GNorm = 2.0258, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.100207
Epoch 6625
Loss = 4.1045e-02, PNorm = 678.0893, GNorm = 6.5149, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.111489
Epoch 6626
Loss = 1.0309e-01, PNorm = 678.1397, GNorm = 5.3816, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.073533
Epoch 6627
Loss = 3.9574e-02, PNorm = 678.1814, GNorm = 1.0401, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.094711
Epoch 6628
Loss = 2.0909e-02, PNorm = 678.2467, GNorm = 0.3513, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.081784
Epoch 6629
Loss = 1.1779e-02, PNorm = 678.3045, GNorm = 0.8309, lr_0 = 9.9545e-04
Loss = 4.4273e-02, PNorm = 678.3597, GNorm = 2.6722, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.083343
Epoch 6630
Loss = 2.4516e-02, PNorm = 678.4039, GNorm = 0.3069, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.083254
Epoch 6631
Loss = 1.7838e-02, PNorm = 678.4409, GNorm = 4.0722, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.088253
Epoch 6632
Loss = 3.5115e-02, PNorm = 678.4749, GNorm = 0.3565, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.081908
Epoch 6633
Loss = 2.0214e-02, PNorm = 678.5076, GNorm = 1.5086, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.089602
Epoch 6634
Loss = 8.9781e-03, PNorm = 678.5559, GNorm = 0.0909, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.097700
Epoch 6635
Loss = 2.1823e-02, PNorm = 678.6058, GNorm = 0.0407, lr_0 = 9.9545e-04
Validation binary_cross_entropy = 0.124436
Epoch 6636
Loss = 3.4637e-03, PNorm = 678.6481, GNorm = 1.2398, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.101953
Epoch 6637
Loss = 8.8949e-02, PNorm = 678.6696, GNorm = 0.0295, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.093708
Epoch 6638
Loss = 9.0923e-02, PNorm = 678.6902, GNorm = 1.2110, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.087522
Epoch 6639
Loss = 4.1406e-03, PNorm = 678.7452, GNorm = 0.1999, lr_0 = 9.9544e-04
Loss = 2.8621e-02, PNorm = 678.8220, GNorm = 1.5143, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.137444
Epoch 6640
Loss = 2.6091e+00, PNorm = 678.9363, GNorm = 10.3057, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.064304
Epoch 6641
Loss = 2.6913e-01, PNorm = 679.2191, GNorm = 6.6605, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.221712
Epoch 6642
Loss = 2.4464e-01, PNorm = 679.4154, GNorm = 2.2963, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.437494
Epoch 6643
Loss = 2.6917e-01, PNorm = 679.5596, GNorm = 11.5167, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.118298
Epoch 6644
Loss = 1.0988e-01, PNorm = 679.6840, GNorm = 2.4264, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.113094
Epoch 6645
Loss = 2.5516e-01, PNorm = 679.7834, GNorm = 7.8842, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.508629
Epoch 6646
Loss = 3.2129e-01, PNorm = 679.9231, GNorm = 18.3714, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.228699
Epoch 6647
Loss = 2.8764e-01, PNorm = 680.0415, GNorm = 6.4100, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.196288
Epoch 6648
Loss = 2.0658e-01, PNorm = 680.1458, GNorm = 5.0705, lr_0 = 9.9544e-04
Validation binary_cross_entropy = 0.154250
Epoch 6649
Loss = 9.7549e-03, PNorm = 680.2426, GNorm = 0.6568, lr_0 = 9.9544e-04
Loss = 1.1907e-01, PNorm = 680.3362, GNorm = 4.7263, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.223411
Epoch 6650
Loss = 1.0536e-01, PNorm = 680.4150, GNorm = 2.2558, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.087931
Epoch 6651
Loss = 5.1901e-02, PNorm = 680.5035, GNorm = 1.0206, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.154825
Epoch 6652
Loss = 8.3747e-02, PNorm = 680.5657, GNorm = 1.9865, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.108296
Epoch 6653
Loss = 9.2270e-02, PNorm = 680.6183, GNorm = 0.7480, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.101865
Epoch 6654
Loss = 4.1151e-02, PNorm = 680.6849, GNorm = 1.8588, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.121571
Epoch 6655
Loss = 5.6547e-02, PNorm = 680.7318, GNorm = 0.5955, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.086445
Epoch 6656
Loss = 8.8977e-02, PNorm = 680.8056, GNorm = 1.0978, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.121566
Epoch 6657
Loss = 7.6817e-02, PNorm = 680.8782, GNorm = 1.4556, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.116568
Epoch 6658
Loss = 1.3615e-02, PNorm = 680.9297, GNorm = 0.3813, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.100692
Epoch 6659
Loss = 1.3100e-01, PNorm = 680.9745, GNorm = 3.6811, lr_0 = 9.9543e-04
Loss = 1.0492e-01, PNorm = 681.0361, GNorm = 0.8995, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.098490
Epoch 6660
Loss = 5.5496e-02, PNorm = 681.1052, GNorm = 3.4926, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.099062
Epoch 6661
Loss = 7.6892e-02, PNorm = 681.1577, GNorm = 0.0458, lr_0 = 9.9543e-04
Validation binary_cross_entropy = 0.094649
Epoch 6662
Loss = 5.9169e-02, PNorm = 681.2288, GNorm = 1.4237, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.131959
Epoch 6663
Loss = 4.2108e-02, PNorm = 681.2726, GNorm = 0.1426, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.091534
Epoch 6664
Loss = 2.3537e-02, PNorm = 681.3242, GNorm = 0.9040, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.092542
Epoch 6665
Loss = 3.6352e-02, PNorm = 681.3764, GNorm = 0.5312, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.130053
Epoch 6666
Loss = 4.7792e-02, PNorm = 681.4457, GNorm = 5.7036, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.123654
Epoch 6667
Loss = 9.7952e-02, PNorm = 681.5239, GNorm = 1.9201, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.107032
Epoch 6668
Loss = 3.3695e-02, PNorm = 681.5836, GNorm = 3.1561, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.127156
Epoch 6669
Loss = 3.7231e-02, PNorm = 681.6416, GNorm = 1.6252, lr_0 = 9.9542e-04
Loss = 6.1065e-02, PNorm = 681.6792, GNorm = 3.3205, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.122222
Epoch 6670
Loss = 4.7905e-02, PNorm = 681.7181, GNorm = 0.2223, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.084917
Epoch 6671
Loss = 8.0944e-02, PNorm = 681.7798, GNorm = 3.5999, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.093844
Epoch 6672
Loss = 2.6331e-02, PNorm = 681.8319, GNorm = 1.4295, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.095022
Epoch 6673
Loss = 4.8621e-02, PNorm = 681.8973, GNorm = 0.5399, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.112676
Epoch 6674
Loss = 5.7198e-02, PNorm = 681.9794, GNorm = 0.5667, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.100631
Epoch 6675
Loss = 4.9989e-02, PNorm = 682.0266, GNorm = 1.3441, lr_0 = 9.9542e-04
Validation binary_cross_entropy = 0.080283
Epoch 6676
Loss = 2.4136e-02, PNorm = 682.1009, GNorm = 1.1317, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.101998
Epoch 6677
Loss = 1.0139e-01, PNorm = 682.1738, GNorm = 2.2038, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.095731
Epoch 6678
Loss = 4.5100e-02, PNorm = 682.2320, GNorm = 2.5843, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.114403
Epoch 6679
Loss = 4.6731e-03, PNorm = 682.2896, GNorm = 0.1271, lr_0 = 9.9541e-04
Loss = 6.0006e-02, PNorm = 682.3406, GNorm = 0.9746, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.099532
Epoch 6680
Loss = 1.6389e-02, PNorm = 682.4020, GNorm = 1.0636, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.092993
Epoch 6681
Loss = 1.0449e-01, PNorm = 682.4639, GNorm = 6.0534, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.082396
Epoch 6682
Loss = 4.7972e-02, PNorm = 682.5324, GNorm = 8.4378, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.077304
Epoch 6683
Loss = 2.5401e-02, PNorm = 682.6120, GNorm = 5.1519, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.097614
Epoch 6684
Loss = 3.6771e-02, PNorm = 682.6708, GNorm = 1.7267, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.111923
Epoch 6685
Loss = 5.4299e-02, PNorm = 682.6973, GNorm = 3.0285, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.080133
Epoch 6686
Loss = 7.4360e-03, PNorm = 682.7329, GNorm = 0.3759, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.081184
Epoch 6687
Loss = 1.8741e-02, PNorm = 682.7750, GNorm = 1.6350, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.100620
Epoch 6688
Loss = 2.2096e-02, PNorm = 682.8385, GNorm = 1.3555, lr_0 = 9.9541e-04
Validation binary_cross_entropy = 0.090976
Epoch 6689
Loss = 1.6761e-02, PNorm = 682.8769, GNorm = 1.4473, lr_0 = 9.9541e-04
Loss = 2.7148e-02, PNorm = 682.9184, GNorm = 0.0906, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.129941
Epoch 6690
Loss = 5.9792e-02, PNorm = 682.9603, GNorm = 0.5656, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.085171
Epoch 6691
Loss = 8.3149e-02, PNorm = 683.0436, GNorm = 0.4331, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.089796
Epoch 6692
Loss = 5.0999e-02, PNorm = 683.1348, GNorm = 0.1274, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.104099
Epoch 6693
Loss = 1.6465e-02, PNorm = 683.1983, GNorm = 1.7467, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.148475
Epoch 6694
Loss = 1.2897e-02, PNorm = 683.2557, GNorm = 3.4215, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.125112
Epoch 6695
Loss = 3.8434e-03, PNorm = 683.3026, GNorm = 0.0945, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.095163
Epoch 6696
Loss = 8.4523e-03, PNorm = 683.3539, GNorm = 1.2548, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.095431
Epoch 6697
Loss = 5.5758e-03, PNorm = 683.4230, GNorm = 0.1489, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.095673
Epoch 6698
Loss = 8.4980e-02, PNorm = 683.4792, GNorm = 3.1650, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.110740
Epoch 6699
Loss = 2.4847e-02, PNorm = 683.5356, GNorm = 2.2903, lr_0 = 9.9540e-04
Loss = 1.0159e-01, PNorm = 683.5671, GNorm = 1.4015, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.093502
Epoch 6700
Loss = 4.5231e-02, PNorm = 683.6173, GNorm = 0.7300, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.114417
Epoch 6701
Loss = 5.1662e-02, PNorm = 683.6520, GNorm = 3.2973, lr_0 = 9.9540e-04
Validation binary_cross_entropy = 0.078122
Epoch 6702
Loss = 6.5485e-02, PNorm = 683.6999, GNorm = 2.7117, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.088229
Epoch 6703
Loss = 2.0315e-02, PNorm = 683.7434, GNorm = 0.3178, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.117823
Epoch 6704
Loss = 1.2054e-02, PNorm = 683.7735, GNorm = 1.6274, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.107069
Epoch 6705
Loss = 2.0164e-02, PNorm = 683.8025, GNorm = 1.5292, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.167682
Epoch 6706
Loss = 5.8234e-02, PNorm = 683.8313, GNorm = 0.0556, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.088787
Epoch 6707
Loss = 5.5167e-02, PNorm = 683.8623, GNorm = 1.1674, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.085735
Epoch 6708
Loss = 9.3806e-03, PNorm = 683.9204, GNorm = 0.4643, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.078294
Epoch 6709
Loss = 3.3817e-02, PNorm = 683.9710, GNorm = 1.9722, lr_0 = 9.9539e-04
Loss = 4.4656e-02, PNorm = 684.0398, GNorm = 2.6184, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.091735
Epoch 6710
Loss = 8.6658e-02, PNorm = 684.0916, GNorm = 1.9780, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.109539
Epoch 6711
Loss = 8.2414e-02, PNorm = 684.1329, GNorm = 4.6458, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.088692
Epoch 6712
Loss = 3.4040e-02, PNorm = 684.1781, GNorm = 0.9190, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.100381
Epoch 6713
Loss = 1.3136e-02, PNorm = 684.2198, GNorm = 0.1027, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.105379
Epoch 6714
Loss = 2.5290e-02, PNorm = 684.2477, GNorm = 3.0868, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.104627
Epoch 6715
Loss = 4.3543e-02, PNorm = 684.2868, GNorm = 0.0455, lr_0 = 9.9539e-04
Validation binary_cross_entropy = 0.136996
Epoch 6716
Loss = 1.5704e-01, PNorm = 684.3393, GNorm = 3.4277, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.093454
Epoch 6717
Loss = 6.2046e-03, PNorm = 684.3946, GNorm = 0.2261, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.117399
Epoch 6718
Loss = 1.0778e-02, PNorm = 684.4605, GNorm = 0.0655, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.095152
Epoch 6719
Loss = 3.2667e-02, PNorm = 684.4902, GNorm = 1.4970, lr_0 = 9.9538e-04
Loss = 3.2117e-02, PNorm = 684.5304, GNorm = 1.9519, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.097932
Epoch 6720
Loss = 3.8325e-02, PNorm = 684.5637, GNorm = 2.2843, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.087185
Epoch 6721
Loss = 1.6352e-02, PNorm = 684.6068, GNorm = 0.1118, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.103619
Epoch 6722
Loss = 4.1529e-02, PNorm = 684.6461, GNorm = 1.4374, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.100145
Epoch 6723
Loss = 1.1610e-02, PNorm = 684.6752, GNorm = 1.1720, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.106574
Epoch 6724
Loss = 3.3233e-02, PNorm = 684.7086, GNorm = 1.8936, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.105807
Epoch 6725
Loss = 1.4643e-02, PNorm = 684.7330, GNorm = 0.7664, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.109784
Epoch 6726
Loss = 5.9047e-02, PNorm = 684.7619, GNorm = 2.4782, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.110435
Epoch 6727
Loss = 4.6188e-02, PNorm = 684.7867, GNorm = 7.1791, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.103445
Epoch 6728
Loss = 1.7592e-02, PNorm = 684.8084, GNorm = 1.4606, lr_0 = 9.9538e-04
Validation binary_cross_entropy = 0.099699
Epoch 6729
Loss = 2.6309e-03, PNorm = 684.8369, GNorm = 0.1113, lr_0 = 9.9537e-04
Loss = 1.3507e-02, PNorm = 684.8679, GNorm = 0.5950, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.113844
Epoch 6730
Loss = 1.2754e-01, PNorm = 684.8915, GNorm = 3.8343, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.121248
Epoch 6731
Loss = 2.6224e-02, PNorm = 684.9262, GNorm = 0.4704, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.110124
Epoch 6732
Loss = 9.6162e-02, PNorm = 684.9551, GNorm = 1.8958, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.083933
Epoch 6733
Loss = 7.2372e-03, PNorm = 685.0003, GNorm = 0.2339, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.140992
Epoch 6734
Loss = 3.5440e-02, PNorm = 685.0350, GNorm = 0.0791, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.084973
Epoch 6735
Loss = 1.1195e-01, PNorm = 685.0788, GNorm = 1.6568, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.112479
Epoch 6736
Loss = 6.9734e-02, PNorm = 685.1231, GNorm = 0.1632, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.114073
Epoch 6737
Loss = 1.8472e-02, PNorm = 685.1729, GNorm = 0.0853, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.120610
Epoch 6738
Loss = 1.1089e-01, PNorm = 685.2049, GNorm = 0.2772, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.119266
Epoch 6739
Loss = 2.3427e-02, PNorm = 685.2338, GNorm = 1.1301, lr_0 = 9.9537e-04
Loss = 5.8051e-02, PNorm = 685.2774, GNorm = 3.4013, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.094254
Epoch 6740
Loss = 9.7773e-03, PNorm = 685.3295, GNorm = 0.4828, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.109279
Epoch 6741
Loss = 7.8120e-02, PNorm = 685.3788, GNorm = 0.0537, lr_0 = 9.9537e-04
Validation binary_cross_entropy = 0.189687
Epoch 6742
Loss = 8.8075e-02, PNorm = 685.4239, GNorm = 1.0005, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.112978
Epoch 6743
Loss = 1.6020e-01, PNorm = 685.4852, GNorm = 1.8365, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.119355
Epoch 6744
Loss = 2.8488e-02, PNorm = 685.5717, GNorm = 5.7025, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.125416
Epoch 6745
Loss = 1.8565e-02, PNorm = 685.6335, GNorm = 0.2259, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.116830
Epoch 6746
Loss = 1.6330e-02, PNorm = 685.6787, GNorm = 0.0764, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.101919
Epoch 6747
Loss = 4.2706e-02, PNorm = 685.7299, GNorm = 2.5589, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.121129
Epoch 6748
Loss = 1.2324e-01, PNorm = 685.8780, GNorm = 1.6564, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.197342
Epoch 6749
Loss = 2.9534e-01, PNorm = 686.1629, GNorm = 9.4554, lr_0 = 9.9536e-04
Loss = 2.6350e-01, PNorm = 686.3629, GNorm = 5.8978, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.158247
Epoch 6750
Loss = 2.1687e-01, PNorm = 686.5067, GNorm = 1.9484, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.142792
Epoch 6751
Loss = 2.3279e-01, PNorm = 686.6160, GNorm = 2.0660, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.133319
Epoch 6752
Loss = 1.7447e-01, PNorm = 686.7080, GNorm = 4.0226, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.116226
Epoch 6753
Loss = 1.6025e-01, PNorm = 686.7923, GNorm = 0.9129, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.103718
Epoch 6754
Loss = 1.5318e-01, PNorm = 686.8665, GNorm = 2.9767, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.126252
Epoch 6755
Loss = 1.0465e-01, PNorm = 686.9290, GNorm = 2.2697, lr_0 = 9.9536e-04
Validation binary_cross_entropy = 0.120205
Epoch 6756
Loss = 7.7659e-02, PNorm = 686.9948, GNorm = 1.3789, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.128479
Epoch 6757
Loss = 1.0775e-01, PNorm = 687.0536, GNorm = 5.0986, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.115179
Epoch 6758
Loss = 1.3248e-01, PNorm = 687.1114, GNorm = 4.6140, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.124380
Epoch 6759
Loss = 2.3029e-01, PNorm = 687.1755, GNorm = 2.8105, lr_0 = 9.9535e-04
Loss = 9.5333e-02, PNorm = 687.2229, GNorm = 1.8768, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.121836
Epoch 6760
Loss = 7.7904e-02, PNorm = 687.2797, GNorm = 2.0702, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.100872
Epoch 6761
Loss = 5.7787e-02, PNorm = 687.3417, GNorm = 0.7832, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.113483
Epoch 6762
Loss = 3.8154e-02, PNorm = 687.3941, GNorm = 2.0168, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.119487
Epoch 6763
Loss = 4.1527e-02, PNorm = 687.4365, GNorm = 2.4374, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.121142
Epoch 6764
Loss = 6.6447e-02, PNorm = 687.4815, GNorm = 5.3519, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.249080
Epoch 6765
Loss = 5.4492e-02, PNorm = 687.5262, GNorm = 0.3224, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.270591
Epoch 6766
Loss = 9.4384e-02, PNorm = 687.5573, GNorm = 1.9684, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.117175
Epoch 6767
Loss = 9.3508e-02, PNorm = 687.5952, GNorm = 5.1175, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.139866
Epoch 6768
Loss = 8.1121e-02, PNorm = 687.6526, GNorm = 3.0020, lr_0 = 9.9535e-04
Validation binary_cross_entropy = 0.080909
Epoch 6769
Loss = 1.1636e-01, PNorm = 687.7019, GNorm = 4.3489, lr_0 = 9.9534e-04
Loss = 6.5275e-02, PNorm = 687.7634, GNorm = 2.2143, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.099170
Epoch 6770
Loss = 1.0097e-01, PNorm = 687.8168, GNorm = 2.3526, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.111924
Epoch 6771
Loss = 7.6787e-02, PNorm = 687.8786, GNorm = 1.8216, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.083309
Epoch 6772
Loss = 1.3071e-01, PNorm = 687.9518, GNorm = 1.7194, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.112717
Epoch 6773
Loss = 4.4727e-02, PNorm = 687.9997, GNorm = 0.4441, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.102533
Epoch 6774
Loss = 1.0911e-01, PNorm = 688.0410, GNorm = 1.0121, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.106434
Epoch 6775
Loss = 6.9446e-02, PNorm = 688.0855, GNorm = 0.7842, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.089156
Epoch 6776
Loss = 1.2793e-02, PNorm = 688.1274, GNorm = 1.4300, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.136951
Epoch 6777
Loss = 5.9908e-02, PNorm = 688.1757, GNorm = 5.0769, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.100178
Epoch 6778
Loss = 2.3844e-01, PNorm = 688.2250, GNorm = 6.1493, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.345704
Epoch 6779
Loss = 1.1139e-01, PNorm = 688.3106, GNorm = 1.5564, lr_0 = 9.9534e-04
Loss = 5.5642e-02, PNorm = 688.3651, GNorm = 1.1493, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.339592
Epoch 6780
Loss = 3.1289e-02, PNorm = 688.4072, GNorm = 4.4975, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.279656
Epoch 6781
Loss = 2.0153e-01, PNorm = 688.4711, GNorm = 1.0146, lr_0 = 9.9534e-04
Validation binary_cross_entropy = 0.104425
Epoch 6782
Loss = 9.6257e-02, PNorm = 688.5838, GNorm = 4.8904, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.099612
Epoch 6783
Loss = 7.7552e-02, PNorm = 688.6587, GNorm = 2.4175, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.086963
Epoch 6784
Loss = 4.5575e-02, PNorm = 688.7127, GNorm = 1.4629, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.087206
Epoch 6785
Loss = 7.4709e-02, PNorm = 688.7535, GNorm = 1.5077, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.073124
Epoch 6786
Loss = 9.5716e-02, PNorm = 688.7929, GNorm = 4.0314, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.096473
Epoch 6787
Loss = 1.5949e-01, PNorm = 688.8529, GNorm = 11.5297, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.117474
Epoch 6788
Loss = 4.2637e-02, PNorm = 688.9084, GNorm = 1.3305, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.117351
Epoch 6789
Loss = 6.8394e-02, PNorm = 688.9490, GNorm = 1.7454, lr_0 = 9.9533e-04
Loss = 1.0047e-01, PNorm = 688.9839, GNorm = 1.0389, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.148047
Epoch 6790
Loss = 3.0325e-02, PNorm = 689.0131, GNorm = 1.1375, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.093010
Epoch 6791
Loss = 3.4635e-02, PNorm = 689.0522, GNorm = 0.1732, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.107279
Epoch 6792
Loss = 5.5365e-02, PNorm = 689.0856, GNorm = 0.0953, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.117036
Epoch 6793
Loss = 6.5696e-02, PNorm = 689.1115, GNorm = 2.8153, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.097235
Epoch 6794
Loss = 5.7114e-02, PNorm = 689.1479, GNorm = 2.6851, lr_0 = 9.9533e-04
Validation binary_cross_entropy = 0.135280
Epoch 6795
Loss = 2.2925e-02, PNorm = 689.1941, GNorm = 2.5999, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.093076
Epoch 6796
Loss = 1.0014e-01, PNorm = 689.2403, GNorm = 7.7258, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.098822
Epoch 6797
Loss = 8.9569e-03, PNorm = 689.3087, GNorm = 0.2571, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.129517
Epoch 6798
Loss = 2.0302e-02, PNorm = 689.3585, GNorm = 1.3479, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.099378
Epoch 6799
Loss = 4.4835e-03, PNorm = 689.3880, GNorm = 0.2877, lr_0 = 9.9532e-04
Loss = 8.4350e-02, PNorm = 689.4247, GNorm = 0.1667, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.086324
Epoch 6800
Loss = 1.0704e-01, PNorm = 689.4737, GNorm = 1.8443, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.081334
Epoch 6801
Loss = 4.3427e-02, PNorm = 689.5205, GNorm = 7.4471, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.087409
Epoch 6802
Loss = 5.7322e-02, PNorm = 689.5678, GNorm = 0.5734, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.100669
Epoch 6803
Loss = 3.1617e-02, PNorm = 689.6082, GNorm = 2.4621, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.096684
Epoch 6804
Loss = 8.1854e-02, PNorm = 689.6286, GNorm = 5.7938, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.081380
Epoch 6805
Loss = 4.4010e-02, PNorm = 689.6559, GNorm = 0.9518, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.080733
Epoch 6806
Loss = 6.5334e-02, PNorm = 689.7053, GNorm = 1.0806, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.115808
Epoch 6807
Loss = 2.0632e-02, PNorm = 689.7466, GNorm = 1.3384, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.097803
Epoch 6808
Loss = 4.5763e-02, PNorm = 689.7677, GNorm = 3.2861, lr_0 = 9.9532e-04
Validation binary_cross_entropy = 0.094686
Epoch 6809
Loss = 1.2062e-03, PNorm = 689.7883, GNorm = 0.0979, lr_0 = 9.9531e-04
Loss = 1.7153e-01, PNorm = 689.8087, GNorm = 20.0032, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.091136
Epoch 6810
Loss = 3.4107e-02, PNorm = 689.8357, GNorm = 3.9983, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.070314
Epoch 6811
Loss = 5.0131e-02, PNorm = 689.8774, GNorm = 0.5439, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.083329
Epoch 6812
Loss = 2.0373e-01, PNorm = 689.9176, GNorm = 6.6845, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.085220
Epoch 6813
Loss = 6.7140e-02, PNorm = 689.9547, GNorm = 0.3046, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.076305
Epoch 6814
Loss = 2.0331e-02, PNorm = 689.9904, GNorm = 2.5180, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.076134
Epoch 6815
Loss = 3.6991e-02, PNorm = 690.0226, GNorm = 4.4038, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.082416
Epoch 6816
Loss = 1.7449e-02, PNorm = 690.0487, GNorm = 0.6996, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.076228
Epoch 6817
Loss = 2.2315e-02, PNorm = 690.0744, GNorm = 0.5734, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.082385
Epoch 6818
Loss = 6.0997e-03, PNorm = 690.1035, GNorm = 0.4217, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.094284
Epoch 6819
Loss = 3.5666e-03, PNorm = 690.1334, GNorm = 0.1269, lr_0 = 9.9531e-04
Loss = 1.7083e-02, PNorm = 690.1614, GNorm = 0.1048, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.103094
Epoch 6820
Loss = 8.8143e-02, PNorm = 690.1846, GNorm = 9.0439, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.100380
Epoch 6821
Loss = 6.7030e-02, PNorm = 690.2253, GNorm = 3.3809, lr_0 = 9.9531e-04
Validation binary_cross_entropy = 0.108796
Epoch 6822
Loss = 5.8052e-02, PNorm = 690.2647, GNorm = 2.0551, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.101160
Epoch 6823
Loss = 3.3843e-02, PNorm = 690.2957, GNorm = 0.2043, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.090131
Epoch 6824
Loss = 4.5600e-02, PNorm = 690.3227, GNorm = 0.1019, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.091687
Epoch 6825
Loss = 2.3555e-02, PNorm = 690.3514, GNorm = 0.1399, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.083408
Epoch 6826
Loss = 5.0954e-02, PNorm = 690.3832, GNorm = 14.5635, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.092683
Epoch 6827
Loss = 2.3593e-02, PNorm = 690.4185, GNorm = 1.3037, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.089017
Epoch 6828
Loss = 2.4524e-02, PNorm = 690.4399, GNorm = 1.6508, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.093322
Epoch 6829
Loss = 1.1808e-02, PNorm = 690.4575, GNorm = 0.8225, lr_0 = 9.9530e-04
Loss = 1.0807e-01, PNorm = 690.4759, GNorm = 8.0180, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.084825
Epoch 6830
Loss = 6.8230e-02, PNorm = 690.5071, GNorm = 0.1790, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.133544
Epoch 6831
Loss = 4.6190e-02, PNorm = 690.5633, GNorm = 1.1386, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.071997
Epoch 6832
Loss = 7.0077e-02, PNorm = 690.6235, GNorm = 8.9329, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.089790
Epoch 6833
Loss = 3.1094e-02, PNorm = 690.6693, GNorm = 0.2994, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.085543
Epoch 6834
Loss = 2.7441e-02, PNorm = 690.7002, GNorm = 0.1941, lr_0 = 9.9530e-04
Validation binary_cross_entropy = 0.082406
Epoch 6835
Loss = 7.3466e-02, PNorm = 690.7270, GNorm = 1.1727, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.086845
Epoch 6836
Loss = 1.1294e-01, PNorm = 690.7611, GNorm = 5.3497, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.090091
Epoch 6837
Loss = 3.5449e-02, PNorm = 690.7922, GNorm = 3.0968, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.099729
Epoch 6838
Loss = 2.2799e-02, PNorm = 690.8294, GNorm = 2.0283, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.091443
Epoch 6839
Loss = 5.8758e-02, PNorm = 690.8562, GNorm = 4.1796, lr_0 = 9.9529e-04
Loss = 2.1589e-02, PNorm = 690.8822, GNorm = 0.2543, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.109028
Epoch 6840
Loss = 3.5779e-02, PNorm = 690.9098, GNorm = 4.3916, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.096471
Epoch 6841
Loss = 2.8746e-02, PNorm = 690.9566, GNorm = 2.4444, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.119182
Epoch 6842
Loss = 5.0751e-02, PNorm = 690.9935, GNorm = 1.4803, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.087172
Epoch 6843
Loss = 1.3847e-02, PNorm = 691.0288, GNorm = 0.4488, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.088911
Epoch 6844
Loss = 2.6818e-02, PNorm = 691.0803, GNorm = 0.3630, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.141666
Epoch 6845
Loss = 2.5946e-01, PNorm = 691.1307, GNorm = 4.5963, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.101693
Epoch 6846
Loss = 1.6465e-02, PNorm = 691.1747, GNorm = 0.5152, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.104666
Epoch 6847
Loss = 6.3972e-02, PNorm = 691.2231, GNorm = 1.3495, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.119609
Epoch 6848
Loss = 4.7044e-02, PNorm = 691.2746, GNorm = 0.1932, lr_0 = 9.9529e-04
Validation binary_cross_entropy = 0.140879
Epoch 6849
Loss = 4.0887e-03, PNorm = 691.3256, GNorm = 0.2193, lr_0 = 9.9528e-04
Loss = 2.6344e-02, PNorm = 691.3592, GNorm = 1.6678, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.169848
Epoch 6850
Loss = 3.7170e-02, PNorm = 691.3769, GNorm = 10.3731, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.272688
Epoch 6851
Loss = 2.3208e-02, PNorm = 691.4089, GNorm = 0.5440, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.290311
Epoch 6852
Loss = 6.7723e-02, PNorm = 691.4619, GNorm = 0.1497, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.191005
Epoch 6853
Loss = 1.3193e-01, PNorm = 691.5085, GNorm = 5.1885, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.146222
Epoch 6854
Loss = 6.0229e-02, PNorm = 691.5565, GNorm = 1.6518, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.129619
Epoch 6855
Loss = 8.8362e-02, PNorm = 691.6070, GNorm = 11.7424, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.234367
Epoch 6856
Loss = 4.6063e-02, PNorm = 691.6770, GNorm = 0.9685, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.090823
Epoch 6857
Loss = 8.3763e-02, PNorm = 691.7648, GNorm = 3.3335, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.134072
Epoch 6858
Loss = 5.9027e-02, PNorm = 691.8816, GNorm = 2.9434, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.099015
Epoch 6859
Loss = 2.1672e-02, PNorm = 691.9558, GNorm = 1.6872, lr_0 = 9.9528e-04
Loss = 2.0921e-02, PNorm = 692.0199, GNorm = 0.5718, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.102721
Epoch 6860
Loss = 3.2388e-02, PNorm = 692.0718, GNorm = 1.0863, lr_0 = 9.9528e-04
Validation binary_cross_entropy = 0.104763
Epoch 6861
Loss = 9.5864e-02, PNorm = 692.1216, GNorm = 1.4107, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.087404
Epoch 6862
Loss = 4.2681e-02, PNorm = 692.1769, GNorm = 1.4004, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.095914
Epoch 6863
Loss = 4.7299e-02, PNorm = 692.2193, GNorm = 0.7237, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.096309
Epoch 6864
Loss = 3.8471e-02, PNorm = 692.2496, GNorm = 4.9672, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.094332
Epoch 6865
Loss = 2.9324e-02, PNorm = 692.2759, GNorm = 1.7835, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.088392
Epoch 6866
Loss = 1.3748e-02, PNorm = 692.3021, GNorm = 1.8699, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.095222
Epoch 6867
Loss = 1.5130e-01, PNorm = 692.3412, GNorm = 3.0823, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.097777
Epoch 6868
Loss = 1.9014e-02, PNorm = 692.3831, GNorm = 2.2989, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.091753
Epoch 6869
Loss = 1.4828e-02, PNorm = 692.4166, GNorm = 0.7961, lr_0 = 9.9527e-04
Loss = 4.5517e-02, PNorm = 692.4495, GNorm = 3.4332, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.099469
Epoch 6870
Loss = 1.0530e-02, PNorm = 692.4816, GNorm = 0.1411, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.103806
Epoch 6871
Loss = 1.2975e-01, PNorm = 692.5195, GNorm = 0.9021, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.117394
Epoch 6872
Loss = 2.6701e-02, PNorm = 692.5547, GNorm = 0.5201, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.097973
Epoch 6873
Loss = 6.0593e-02, PNorm = 692.5841, GNorm = 0.3237, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.107383
Epoch 6874
Loss = 3.0188e-02, PNorm = 692.6182, GNorm = 0.2836, lr_0 = 9.9527e-04
Validation binary_cross_entropy = 0.105282
Epoch 6875
Loss = 1.5678e-02, PNorm = 692.6540, GNorm = 1.4744, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.140021
Epoch 6876
Loss = 9.1787e-03, PNorm = 692.6895, GNorm = 0.9285, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.147721
Epoch 6877
Loss = 1.0370e-02, PNorm = 692.7150, GNorm = 2.0462, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.108578
Epoch 6878
Loss = 1.2140e-02, PNorm = 692.7426, GNorm = 0.8779, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.118744
Epoch 6879
Loss = 6.5673e-03, PNorm = 692.7794, GNorm = 0.2663, lr_0 = 9.9526e-04
Loss = 3.8666e-02, PNorm = 692.8145, GNorm = 1.2075, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.124890
Epoch 6880
Loss = 6.4391e-02, PNorm = 692.8370, GNorm = 0.7050, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.109648
Epoch 6881
Loss = 1.2572e-02, PNorm = 692.8715, GNorm = 0.2659, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.103590
Epoch 6882
Loss = 1.4594e-01, PNorm = 692.9420, GNorm = 0.8523, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.128443
Epoch 6883
Loss = 9.5741e-02, PNorm = 693.0252, GNorm = 2.9567, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.092704
Epoch 6884
Loss = 7.7819e-02, PNorm = 693.0828, GNorm = 1.0563, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.091850
Epoch 6885
Loss = 3.5692e-02, PNorm = 693.1316, GNorm = 0.2935, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.093431
Epoch 6886
Loss = 4.2073e-02, PNorm = 693.1691, GNorm = 4.5750, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.097191
Epoch 6887
Loss = 1.9920e-02, PNorm = 693.2018, GNorm = 0.7014, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.113539
Epoch 6888
Loss = 9.5698e-02, PNorm = 693.2394, GNorm = 4.6251, lr_0 = 9.9526e-04
Validation binary_cross_entropy = 0.109539
Epoch 6889
Loss = 6.3744e-02, PNorm = 693.2690, GNorm = 1.0867, lr_0 = 9.9525e-04
Loss = 3.5247e-02, PNorm = 693.3050, GNorm = 0.1040, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.083356
Epoch 6890
Loss = 2.0112e-02, PNorm = 693.3424, GNorm = 2.4409, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.088649
Epoch 6891
Loss = 8.5301e-02, PNorm = 693.3747, GNorm = 1.5079, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.112409
Epoch 6892
Loss = 6.1982e-02, PNorm = 693.4070, GNorm = 2.5679, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.114047
Epoch 6893
Loss = 1.3440e-02, PNorm = 693.4390, GNorm = 1.2195, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.097953
Epoch 6894
Loss = 4.7973e-02, PNorm = 693.4731, GNorm = 2.8918, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.098551
Epoch 6895
Loss = 7.6519e-03, PNorm = 693.5098, GNorm = 0.4411, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.096742
Epoch 6896
Loss = 3.4827e-03, PNorm = 693.5354, GNorm = 0.2503, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.118247
Epoch 6897
Loss = 1.3689e-02, PNorm = 693.5683, GNorm = 0.9943, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.128109
Epoch 6898
Loss = 2.8344e-01, PNorm = 693.5928, GNorm = 0.1062, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.079236
Epoch 6899
Loss = 9.4803e-03, PNorm = 693.6176, GNorm = 0.3224, lr_0 = 9.9525e-04
Loss = 3.7474e-02, PNorm = 693.6537, GNorm = 0.7788, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.094007
Epoch 6900
Loss = 3.1767e-02, PNorm = 693.6938, GNorm = 0.0101, lr_0 = 9.9525e-04
Validation binary_cross_entropy = 0.107528
Epoch 6901
Loss = 2.3133e-02, PNorm = 693.7231, GNorm = 2.3638, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.116061
Epoch 6902
Loss = 4.9156e-02, PNorm = 693.7458, GNorm = 0.3441, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.107406
Epoch 6903
Loss = 5.6891e-02, PNorm = 693.7762, GNorm = 1.1312, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.135044
Epoch 6904
Loss = 1.1727e-02, PNorm = 693.8065, GNorm = 2.0372, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.284937
Epoch 6905
Loss = 2.1037e-01, PNorm = 693.8366, GNorm = 0.0223, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.143798
Epoch 6906
Loss = 4.3830e-02, PNorm = 693.8679, GNorm = 2.5525, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.160626
Epoch 6907
Loss = 2.1503e-02, PNorm = 693.9051, GNorm = 0.5339, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.123751
Epoch 6908
Loss = 3.6349e-02, PNorm = 693.9759, GNorm = 0.7444, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.175104
Epoch 6909
Loss = 1.1612e-01, PNorm = 694.0531, GNorm = 1.6202, lr_0 = 9.9524e-04
Loss = 9.9830e-02, PNorm = 694.1236, GNorm = 0.0253, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.144322
Epoch 6910
Loss = 4.4723e-02, PNorm = 694.1834, GNorm = 0.0763, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.136729
Epoch 6911
Loss = 4.0512e-02, PNorm = 694.2328, GNorm = 0.1392, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.135918
Epoch 6912
Loss = 2.9080e+00, PNorm = 694.2824, GNorm = 597.0567, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.089325
Epoch 6913
Loss = 6.9654e-01, PNorm = 694.4427, GNorm = 6.0880, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.309356
Epoch 6914
Loss = 1.3818e-01, PNorm = 694.6108, GNorm = 4.6725, lr_0 = 9.9524e-04
Validation binary_cross_entropy = 0.115664
Epoch 6915
Loss = 4.9796e-02, PNorm = 694.7197, GNorm = 4.6777, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.140482
Epoch 6916
Loss = 7.2968e-02, PNorm = 694.7974, GNorm = 4.9989, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.124151
Epoch 6917
Loss = 2.0520e-01, PNorm = 694.8544, GNorm = 10.9883, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.146835
Epoch 6918
Loss = 1.4249e-01, PNorm = 694.9149, GNorm = 5.6986, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.090573
Epoch 6919
Loss = 1.7953e-02, PNorm = 694.9698, GNorm = 0.7199, lr_0 = 9.9523e-04
Loss = 1.7057e-01, PNorm = 695.0409, GNorm = 1.4523, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.155656
Epoch 6920
Loss = 8.8343e-02, PNorm = 695.0962, GNorm = 4.6709, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.104786
Epoch 6921
Loss = 9.0859e-02, PNorm = 695.1403, GNorm = 0.2436, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.086715
Epoch 6922
Loss = 5.4403e-02, PNorm = 695.1904, GNorm = 1.9254, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.094643
Epoch 6923
Loss = 9.4355e-02, PNorm = 695.2370, GNorm = 3.4823, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.088607
Epoch 6924
Loss = 7.6412e-02, PNorm = 695.2837, GNorm = 1.6774, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.097811
Epoch 6925
Loss = 2.1113e-02, PNorm = 695.3196, GNorm = 0.9238, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.122324
Epoch 6926
Loss = 2.7759e-02, PNorm = 695.3551, GNorm = 1.3990, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.164119
Epoch 6927
Loss = 6.7996e-02, PNorm = 695.3804, GNorm = 4.6705, lr_0 = 9.9523e-04
Validation binary_cross_entropy = 0.137793
Epoch 6928
Loss = 1.6634e-01, PNorm = 695.4167, GNorm = 12.2876, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.096638
Epoch 6929
Loss = 2.9269e-01, PNorm = 695.4659, GNorm = 15.3023, lr_0 = 9.9522e-04
Loss = 5.3816e-02, PNorm = 695.5461, GNorm = 0.4875, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.095595
Epoch 6930
Loss = 3.8100e-02, PNorm = 695.6205, GNorm = 0.6547, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.110218
Epoch 6931
Loss = 2.5672e-02, PNorm = 695.6717, GNorm = 1.6900, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.104884
Epoch 6932
Loss = 5.7376e-02, PNorm = 695.7072, GNorm = 0.8756, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.107186
Epoch 6933
Loss = 3.7728e-02, PNorm = 695.7355, GNorm = 0.0429, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.127397
Epoch 6934
Loss = 3.2646e-02, PNorm = 695.7557, GNorm = 4.8258, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.171683
Epoch 6935
Loss = 8.4940e-02, PNorm = 695.7857, GNorm = 3.7852, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.125912
Epoch 6936
Loss = 1.0437e-01, PNorm = 695.8120, GNorm = 21.4152, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.112508
Epoch 6937
Loss = 7.9204e-02, PNorm = 695.8537, GNorm = 3.4978, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.142767
Epoch 6938
Loss = 9.1193e-03, PNorm = 695.8943, GNorm = 1.1633, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.090767
Epoch 6939
Loss = 3.2182e-02, PNorm = 695.9292, GNorm = 1.6706, lr_0 = 9.9522e-04
Loss = 1.8843e-02, PNorm = 695.9695, GNorm = 0.3299, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.088422
Epoch 6940
Loss = 3.7736e-02, PNorm = 696.0016, GNorm = 1.3307, lr_0 = 9.9522e-04
Validation binary_cross_entropy = 0.090589
Epoch 6941
Loss = 4.5241e-02, PNorm = 696.0300, GNorm = 0.1130, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.101143
Epoch 6942
Loss = 4.7068e-02, PNorm = 696.0654, GNorm = 0.0632, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.154622
Epoch 6943
Loss = 1.1324e-01, PNorm = 696.0951, GNorm = 0.0864, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.176373
Epoch 6944
Loss = 4.4923e-02, PNorm = 696.1348, GNorm = 3.9075, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.192852
Epoch 6945
Loss = 9.3675e-02, PNorm = 696.1825, GNorm = 2.6609, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.132291
Epoch 6946
Loss = 6.3946e-02, PNorm = 696.2393, GNorm = 3.7214, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.154630
Epoch 6947
Loss = 3.4169e-02, PNorm = 696.2935, GNorm = 0.3165, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.164758
Epoch 6948
Loss = 2.7350e-02, PNorm = 696.3342, GNorm = 6.2751, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.191934
Epoch 6949
Loss = 1.1822e-02, PNorm = 696.3675, GNorm = 0.7720, lr_0 = 9.9521e-04
Loss = 2.9825e-02, PNorm = 696.3985, GNorm = 0.2217, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.184358
Epoch 6950
Loss = 3.4059e-02, PNorm = 696.4261, GNorm = 0.4092, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.222011
Epoch 6951
Loss = 5.4001e-02, PNorm = 696.4513, GNorm = 4.8259, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.205129
Epoch 6952
Loss = 7.4228e-02, PNorm = 696.4802, GNorm = 1.6300, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.156597
Epoch 6953
Loss = 2.4418e-02, PNorm = 696.5171, GNorm = 1.8234, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.196766
Epoch 6954
Loss = 5.4962e-02, PNorm = 696.5632, GNorm = 0.6364, lr_0 = 9.9521e-04
Validation binary_cross_entropy = 0.116944
Epoch 6955
Loss = 6.2964e-02, PNorm = 696.6148, GNorm = 2.1617, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.161730
Epoch 6956
Loss = 1.0663e-02, PNorm = 696.6626, GNorm = 0.4890, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.189441
Epoch 6957
Loss = 8.3097e-03, PNorm = 696.6939, GNorm = 0.0278, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.160110
Epoch 6958
Loss = 3.5907e-02, PNorm = 696.7267, GNorm = 1.1532, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.166657
Epoch 6959
Loss = 1.4039e-02, PNorm = 696.7642, GNorm = 0.5514, lr_0 = 9.9520e-04
Loss = 5.9627e-02, PNorm = 696.7978, GNorm = 1.5066, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.171857
Epoch 6960
Loss = 1.8367e-02, PNorm = 696.8289, GNorm = 0.1595, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.160986
Epoch 6961
Loss = 5.5160e-01, PNorm = 696.8669, GNorm = 1.3301, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.124724
Epoch 6962
Loss = 1.0076e-01, PNorm = 696.9136, GNorm = 0.3804, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.099987
Epoch 6963
Loss = 6.9572e-02, PNorm = 696.9553, GNorm = 4.2310, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.113049
Epoch 6964
Loss = 3.8700e-02, PNorm = 696.9929, GNorm = 2.0166, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.137639
Epoch 6965
Loss = 7.9801e-02, PNorm = 697.0249, GNorm = 1.5661, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.115800
Epoch 6966
Loss = 1.0342e-02, PNorm = 697.0525, GNorm = 0.5638, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.164154
Epoch 6967
Loss = 1.0701e-01, PNorm = 697.0844, GNorm = 3.7684, lr_0 = 9.9520e-04
Validation binary_cross_entropy = 0.137591
Epoch 6968
Loss = 1.6661e-02, PNorm = 697.1175, GNorm = 0.8642, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.124378
Epoch 6969
Loss = 3.1575e-02, PNorm = 697.1540, GNorm = 1.6513, lr_0 = 9.9519e-04
Loss = 8.3624e-02, PNorm = 697.1977, GNorm = 1.1879, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.128100
Epoch 6970
Loss = 4.1817e-02, PNorm = 697.2346, GNorm = 7.3920, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.129220
Epoch 6971
Loss = 5.3701e-02, PNorm = 697.2742, GNorm = 0.6323, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.148429
Epoch 6972
Loss = 3.6082e-02, PNorm = 697.3029, GNorm = 0.9350, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.158014
Epoch 6973
Loss = 4.4363e-02, PNorm = 697.3219, GNorm = 2.2694, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.145379
Epoch 6974
Loss = 6.2034e-02, PNorm = 697.3425, GNorm = 0.4735, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.121968
Epoch 6975
Loss = 3.5802e-02, PNorm = 697.3642, GNorm = 1.0487, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.129815
Epoch 6976
Loss = 4.6875e-02, PNorm = 697.3913, GNorm = 3.9897, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.140233
Epoch 6977
Loss = 7.1342e-03, PNorm = 697.4158, GNorm = 0.4565, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.174636
Epoch 6978
Loss = 1.4683e-01, PNorm = 697.4437, GNorm = 6.9989, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.127224
Epoch 6979
Loss = 3.9401e-02, PNorm = 697.4656, GNorm = 1.6776, lr_0 = 9.9519e-04
Loss = 2.0740e-02, PNorm = 697.4994, GNorm = 2.9209, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.133314
Epoch 6980
Loss = 3.0080e-02, PNorm = 697.5300, GNorm = 0.1290, lr_0 = 9.9519e-04
Validation binary_cross_entropy = 0.159138
Epoch 6981
Loss = 6.8147e-02, PNorm = 697.5595, GNorm = 0.8644, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.126164
Epoch 6982
Loss = 5.4618e-02, PNorm = 697.6035, GNorm = 0.9110, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.104845
Epoch 6983
Loss = 3.4414e-02, PNorm = 697.6423, GNorm = 1.8783, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.099335
Epoch 6984
Loss = 6.0595e-02, PNorm = 697.6795, GNorm = 0.5813, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.096958
Epoch 6985
Loss = 2.0660e-02, PNorm = 697.7124, GNorm = 1.1773, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.091760
Epoch 6986
Loss = 3.7818e-02, PNorm = 697.7370, GNorm = 2.2801, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.106593
Epoch 6987
Loss = 4.1459e-02, PNorm = 697.7612, GNorm = 0.6977, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.105063
Epoch 6988
Loss = 6.1064e-03, PNorm = 697.7962, GNorm = 0.5320, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.115458
Epoch 6989
Loss = 1.5120e-02, PNorm = 697.8468, GNorm = 0.8176, lr_0 = 9.9518e-04
Loss = 4.7520e-02, PNorm = 697.8887, GNorm = 4.4920, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.087731
Epoch 6990
Loss = 2.4799e-02, PNorm = 697.9240, GNorm = 1.4362, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.082060
Epoch 6991
Loss = 3.7401e-02, PNorm = 697.9584, GNorm = 4.1884, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.095025
Epoch 6992
Loss = 7.3619e-02, PNorm = 697.9924, GNorm = 12.5499, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.134326
Epoch 6993
Loss = 9.9902e-02, PNorm = 698.0341, GNorm = 2.2263, lr_0 = 9.9518e-04
Validation binary_cross_entropy = 0.112524
Epoch 6994
Loss = 2.3583e-02, PNorm = 698.0828, GNorm = 1.2106, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.125476
Epoch 6995
Loss = 8.2317e-02, PNorm = 698.1185, GNorm = 1.1920, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.107608
Epoch 6996
Loss = 3.9972e-02, PNorm = 698.1638, GNorm = 4.4277, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.124872
Epoch 6997
Loss = 3.0169e-03, PNorm = 698.2024, GNorm = 0.0829, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.092184
Epoch 6998
Loss = 9.8757e-03, PNorm = 698.2299, GNorm = 0.6889, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.095252
Epoch 6999
Loss = 9.1237e-03, PNorm = 698.2691, GNorm = 0.8453, lr_0 = 9.9517e-04
Loss = 5.4901e-02, PNorm = 698.3160, GNorm = 3.9689, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.113950
Epoch 7000
Loss = 2.6477e-02, PNorm = 698.3490, GNorm = 0.3386, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.163210
Epoch 7001
Loss = 8.0190e-02, PNorm = 698.3703, GNorm = 1.6437, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.107176
Epoch 7002
Loss = 6.7526e-03, PNorm = 698.3885, GNorm = 0.1152, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.096133
Epoch 7003
Loss = 3.9729e-02, PNorm = 698.4191, GNorm = 1.2511, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.124312
Epoch 7004
Loss = 4.1966e-02, PNorm = 698.4512, GNorm = 0.2000, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.133870
Epoch 7005
Loss = 1.0758e-01, PNorm = 698.4824, GNorm = 1.3853, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.095393
Epoch 7006
Loss = 2.9471e-02, PNorm = 698.5111, GNorm = 0.0419, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.107894
Epoch 7007
Loss = 6.0287e-02, PNorm = 698.5409, GNorm = 4.0239, lr_0 = 9.9517e-04
Validation binary_cross_entropy = 0.116838
Epoch 7008
Loss = 1.6678e-02, PNorm = 698.5641, GNorm = 0.7381, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.099194
Epoch 7009
Loss = 1.1261e-02, PNorm = 698.5815, GNorm = 0.4270, lr_0 = 9.9516e-04
Loss = 5.8561e-02, PNorm = 698.6115, GNorm = 0.7053, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.096882
Epoch 7010
Loss = 2.7748e-02, PNorm = 698.6404, GNorm = 1.6788, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.093271
Epoch 7011
Loss = 7.0153e-02, PNorm = 698.6673, GNorm = 2.7080, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.092312
Epoch 7012
Loss = 3.6705e-02, PNorm = 698.6901, GNorm = 1.8453, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.086884
Epoch 7013
Loss = 2.4714e-02, PNorm = 698.7225, GNorm = 2.6566, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.093686
Epoch 7014
Loss = 3.6558e-02, PNorm = 698.7521, GNorm = 0.9236, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.091430
Epoch 7015
Loss = 1.8161e-02, PNorm = 698.7744, GNorm = 1.4479, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.098936
Epoch 7016
Loss = 3.1459e-02, PNorm = 698.8050, GNorm = 0.9968, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.102508
Epoch 7017
Loss = 6.6178e-02, PNorm = 698.8422, GNorm = 4.4140, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.101984
Epoch 7018
Loss = 3.8558e-02, PNorm = 698.8778, GNorm = 2.4050, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.102763
Epoch 7019
Loss = 9.9791e-04, PNorm = 698.9108, GNorm = 0.0438, lr_0 = 9.9516e-04
Loss = 2.4238e-02, PNorm = 698.9439, GNorm = 4.3922, lr_0 = 9.9516e-04
Validation binary_cross_entropy = 0.114805
Epoch 7020
Loss = 7.0049e-02, PNorm = 698.9751, GNorm = 6.9844, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.105071
Epoch 7021
Loss = 4.8877e-02, PNorm = 699.0033, GNorm = 3.0130, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.082400
Epoch 7022
Loss = 2.5884e-02, PNorm = 699.0434, GNorm = 0.6358, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.090286
Epoch 7023
Loss = 1.8961e-02, PNorm = 699.0733, GNorm = 1.1043, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.097252
Epoch 7024
Loss = 3.5385e-02, PNorm = 699.0948, GNorm = 0.1051, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.118811
Epoch 7025
Loss = 4.0261e-02, PNorm = 699.1164, GNorm = 0.1357, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.092053
Epoch 7026
Loss = 1.4322e-02, PNorm = 699.1463, GNorm = 0.0825, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.091025
Epoch 7027
Loss = 5.6093e-02, PNorm = 699.1790, GNorm = 0.9692, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.084074
Epoch 7028
Loss = 1.8576e-01, PNorm = 699.2120, GNorm = 10.1711, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.109452
Epoch 7029
Loss = 1.0124e-03, PNorm = 699.2484, GNorm = 0.1064, lr_0 = 9.9515e-04
Loss = 3.7686e-02, PNorm = 699.2714, GNorm = 0.1418, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.086073
Epoch 7030
Loss = 6.9503e-02, PNorm = 699.3029, GNorm = 2.1134, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.071925
Epoch 7031
Loss = 9.2635e-02, PNorm = 699.3462, GNorm = 0.3382, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.071073
Epoch 7032
Loss = 4.5945e-02, PNorm = 699.4131, GNorm = 2.4175, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.101637
Epoch 7033
Loss = 1.4227e-01, PNorm = 699.4665, GNorm = 0.2494, lr_0 = 9.9515e-04
Validation binary_cross_entropy = 0.101751
Epoch 7034
Loss = 3.4750e-02, PNorm = 699.5073, GNorm = 1.0328, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.122624
Epoch 7035
Loss = 6.0201e-02, PNorm = 699.5484, GNorm = 0.2242, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.119833
Epoch 7036
Loss = 2.7201e-02, PNorm = 699.5913, GNorm = 0.3156, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.148886
Epoch 7037
Loss = 5.9299e-03, PNorm = 699.6244, GNorm = 0.4098, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.138814
Epoch 7038
Loss = 3.3910e-02, PNorm = 699.6498, GNorm = 3.1007, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.165165
Epoch 7039
Loss = 7.0355e-03, PNorm = 699.6840, GNorm = 0.4982, lr_0 = 9.9514e-04
Loss = 6.3941e-02, PNorm = 699.7065, GNorm = 0.6606, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.117573
Epoch 7040
Loss = 1.7476e-02, PNorm = 699.7303, GNorm = 0.1766, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.116104
Epoch 7041
Loss = 4.9462e-02, PNorm = 699.7612, GNorm = 0.1183, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.132258
Epoch 7042
Loss = 4.2739e-02, PNorm = 699.7992, GNorm = 0.0392, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.160651
Epoch 7043
Loss = 7.2956e-02, PNorm = 699.8319, GNorm = 4.3926, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.173457
Epoch 7044
Loss = 6.5630e-03, PNorm = 699.8638, GNorm = 0.0897, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.202556
Epoch 7045
Loss = 2.9748e-02, PNorm = 699.8835, GNorm = 1.4159, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.137309
Epoch 7046
Loss = 2.9038e-02, PNorm = 699.9223, GNorm = 3.7025, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.146141
Epoch 7047
Loss = 9.6888e-03, PNorm = 699.9877, GNorm = 0.1648, lr_0 = 9.9514e-04
Validation binary_cross_entropy = 0.135252
Epoch 7048
Loss = 1.8213e-01, PNorm = 700.0323, GNorm = 4.7379, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.096628
Epoch 7049
Loss = 3.0587e-02, PNorm = 700.1023, GNorm = 1.1848, lr_0 = 9.9513e-04
Loss = 4.0418e-02, PNorm = 700.1588, GNorm = 3.0078, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.103521
Epoch 7050
Loss = 6.3729e-02, PNorm = 700.2081, GNorm = 0.3659, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.107196
Epoch 7051
Loss = 4.4178e-02, PNorm = 700.2542, GNorm = 0.4094, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.122255
Epoch 7052
Loss = 5.2171e-02, PNorm = 700.2905, GNorm = 2.6228, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.108059
Epoch 7053
Loss = 5.0047e-02, PNorm = 700.3204, GNorm = 3.2175, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.097227
Epoch 7054
Loss = 6.1887e-02, PNorm = 700.3592, GNorm = 3.7079, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.093666
Epoch 7055
Loss = 9.8586e-02, PNorm = 700.3981, GNorm = 0.0950, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.110037
Epoch 7056
Loss = 8.0470e-02, PNorm = 700.4445, GNorm = 0.8696, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.072214
Epoch 7057
Loss = 4.4815e-02, PNorm = 700.4884, GNorm = 4.0444, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.088950
Epoch 7058
Loss = 8.0408e-02, PNorm = 700.5398, GNorm = 2.2808, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.082572
Epoch 7059
Loss = 7.7500e-02, PNorm = 700.5828, GNorm = 3.5977, lr_0 = 9.9513e-04
Loss = 2.5425e-02, PNorm = 700.6195, GNorm = 0.3117, lr_0 = 9.9513e-04
Validation binary_cross_entropy = 0.074197
Epoch 7060
Loss = 2.9442e-02, PNorm = 700.6558, GNorm = 0.5950, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.090916
Epoch 7061
Loss = 1.5212e-01, PNorm = 700.6962, GNorm = 0.4700, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.079971
Epoch 7062
Loss = 6.3701e-02, PNorm = 700.7393, GNorm = 1.7173, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.073388
Epoch 7063
Loss = 4.5681e-02, PNorm = 700.7857, GNorm = 0.3822, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.089505
Epoch 7064
Loss = 8.1377e-02, PNorm = 700.8331, GNorm = 0.3825, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.087013
Epoch 7065
Loss = 1.3128e-01, PNorm = 700.8730, GNorm = 0.9398, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.082985
Epoch 7066
Loss = 4.2135e-02, PNorm = 700.9167, GNorm = 3.8129, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.086342
Epoch 7067
Loss = 6.6563e-02, PNorm = 700.9613, GNorm = 0.8779, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.106306
Epoch 7068
Loss = 4.9209e-02, PNorm = 701.0003, GNorm = 3.1049, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.063807
Epoch 7069
Loss = 1.0871e-02, PNorm = 701.0325, GNorm = 0.4265, lr_0 = 9.9512e-04
Loss = 1.0925e-01, PNorm = 701.0790, GNorm = 11.6582, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.072870
Epoch 7070
Loss = 1.2026e-01, PNorm = 701.1403, GNorm = 1.3568, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.063310
Epoch 7071
Loss = 2.7485e-02, PNorm = 701.2037, GNorm = 0.3854, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.064929
Epoch 7072
Loss = 5.6896e-02, PNorm = 701.2471, GNorm = 8.7525, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.061730
Epoch 7073
Loss = 3.2547e-02, PNorm = 701.2933, GNorm = 2.0133, lr_0 = 9.9512e-04
Validation binary_cross_entropy = 0.058979
Epoch 7074
Loss = 5.0659e-02, PNorm = 701.3385, GNorm = 1.2065, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.057534
Epoch 7075
Loss = 4.8396e-02, PNorm = 701.3922, GNorm = 2.3889, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.075623
Epoch 7076
Loss = 8.8894e-02, PNorm = 701.4361, GNorm = 1.2644, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.075720
Epoch 7077
Loss = 2.8077e-02, PNorm = 701.4638, GNorm = 1.5362, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.069755
Epoch 7078
Loss = 1.1060e-02, PNorm = 701.4841, GNorm = 0.0633, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.094727
Epoch 7079
Loss = 1.9662e-03, PNorm = 701.5030, GNorm = 0.2346, lr_0 = 9.9511e-04
Loss = 1.8839e-02, PNorm = 701.5221, GNorm = 0.0921, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.091820
Epoch 7080
Loss = 4.3897e-02, PNorm = 701.5436, GNorm = 5.9907, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.076462
Epoch 7081
Loss = 5.5040e-02, PNorm = 701.5869, GNorm = 0.4710, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.083054
Epoch 7082
Loss = 2.9424e-02, PNorm = 701.6274, GNorm = 0.0640, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.088303
Epoch 7083
Loss = 2.1789e-02, PNorm = 701.6559, GNorm = 2.0734, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.075912
Epoch 7084
Loss = 2.7950e-02, PNorm = 701.6739, GNorm = 1.1508, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.072477
Epoch 7085
Loss = 7.4491e-03, PNorm = 701.6982, GNorm = 0.3317, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.073124
Epoch 7086
Loss = 1.4238e-02, PNorm = 701.7245, GNorm = 0.5938, lr_0 = 9.9511e-04
Validation binary_cross_entropy = 0.074367
Epoch 7087
Loss = 1.0402e-02, PNorm = 701.7636, GNorm = 0.5547, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.078347
Epoch 7088
Loss = 2.3254e-02, PNorm = 701.8039, GNorm = 0.1841, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.082545
Epoch 7089
Loss = 8.4727e-03, PNorm = 701.8332, GNorm = 0.6938, lr_0 = 9.9510e-04
Loss = 2.2719e-02, PNorm = 701.8578, GNorm = 0.0984, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.081728
Epoch 7090
Loss = 9.8996e-03, PNorm = 701.8774, GNorm = 1.5358, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.086729
Epoch 7091
Loss = 4.1563e-02, PNorm = 701.8973, GNorm = 0.2416, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.076124
Epoch 7092
Loss = 1.2198e-02, PNorm = 701.9237, GNorm = 1.3430, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.083321
Epoch 7093
Loss = 7.0588e-03, PNorm = 701.9525, GNorm = 0.0328, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.098290
Epoch 7094
Loss = 2.9806e-02, PNorm = 701.9729, GNorm = 7.2266, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.099536
Epoch 7095
Loss = 2.4379e-02, PNorm = 701.9973, GNorm = 2.5846, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.100396
Epoch 7096
Loss = 7.1393e-02, PNorm = 702.0317, GNorm = 0.0880, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.084209
Epoch 7097
Loss = 1.6719e-03, PNorm = 702.0541, GNorm = 0.0657, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.085850
Epoch 7098
Loss = 2.9217e-02, PNorm = 702.0802, GNorm = 0.5988, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.077760
Epoch 7099
Loss = 1.7276e-02, PNorm = 702.1199, GNorm = 0.7659, lr_0 = 9.9510e-04
Loss = 1.0462e-01, PNorm = 702.1731, GNorm = 2.1017, lr_0 = 9.9510e-04
Validation binary_cross_entropy = 0.086408
Epoch 7100
Loss = 1.4802e-02, PNorm = 702.2157, GNorm = 0.1687, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.081130
Epoch 7101
Loss = 6.1188e-02, PNorm = 702.2594, GNorm = 0.2915, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.091808
Epoch 7102
Loss = 1.8546e-02, PNorm = 702.2931, GNorm = 0.8318, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.087788
Epoch 7103
Loss = 2.0863e-02, PNorm = 702.3198, GNorm = 0.2742, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.056606
Epoch 7104
Loss = 1.3577e-01, PNorm = 702.3950, GNorm = 13.3913, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.127265
Epoch 7105
Loss = 1.6951e-01, PNorm = 702.4924, GNorm = 10.1917, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.072558
Epoch 7106
Loss = 1.4318e-01, PNorm = 702.5634, GNorm = 0.3587, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.148890
Epoch 7107
Loss = 1.3561e-01, PNorm = 702.6235, GNorm = 2.5550, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.127886
Epoch 7108
Loss = 9.9462e-02, PNorm = 702.6684, GNorm = 2.0954, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.160044
Epoch 7109
Loss = 1.0238e-02, PNorm = 702.7011, GNorm = 0.9171, lr_0 = 9.9509e-04
Loss = 7.8306e-02, PNorm = 702.7519, GNorm = 1.4659, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.122792
Epoch 7110
Loss = 6.5205e-02, PNorm = 702.8008, GNorm = 0.1718, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.092105
Epoch 7111
Loss = 2.7918e-02, PNorm = 702.8429, GNorm = 0.3755, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.092578
Epoch 7112
Loss = 6.6455e-02, PNorm = 702.8950, GNorm = 2.3482, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.122579
Epoch 7113
Loss = 7.6207e-02, PNorm = 702.9306, GNorm = 0.6758, lr_0 = 9.9509e-04
Validation binary_cross_entropy = 0.090175
Epoch 7114
Loss = 4.8907e-02, PNorm = 702.9521, GNorm = 2.5266, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.078741
Epoch 7115
Loss = 1.4019e-02, PNorm = 702.9775, GNorm = 0.8824, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.088995
Epoch 7116
Loss = 2.3904e-02, PNorm = 703.0028, GNorm = 0.2384, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.085517
Epoch 7117
Loss = 2.7068e-02, PNorm = 703.0290, GNorm = 0.1279, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.096846
Epoch 7118
Loss = 5.4996e-02, PNorm = 703.0556, GNorm = 3.5481, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.104938
Epoch 7119
Loss = 1.9360e-02, PNorm = 703.0851, GNorm = 1.1327, lr_0 = 9.9508e-04
Loss = 3.8363e-02, PNorm = 703.1159, GNorm = 1.0485, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.096767
Epoch 7120
Loss = 6.7062e-02, PNorm = 703.1498, GNorm = 1.3973, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.108247
Epoch 7121
Loss = 9.4859e-02, PNorm = 703.1836, GNorm = 12.0987, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.094122
Epoch 7122
Loss = 2.3152e-02, PNorm = 703.2170, GNorm = 1.8216, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.077369
Epoch 7123
Loss = 6.0661e-02, PNorm = 703.2720, GNorm = 4.9412, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.076472
Epoch 7124
Loss = 6.4926e-02, PNorm = 703.3544, GNorm = 4.2151, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.091707
Epoch 7125
Loss = 2.2313e-02, PNorm = 703.4134, GNorm = 1.2407, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.105592
Epoch 7126
Loss = 3.7027e-02, PNorm = 703.4484, GNorm = 1.1371, lr_0 = 9.9508e-04
Validation binary_cross_entropy = 0.065511
Epoch 7127
Loss = 3.2635e-02, PNorm = 703.4905, GNorm = 1.0458, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.079337
Epoch 7128
Loss = 4.7425e-02, PNorm = 703.5472, GNorm = 0.9341, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.086527
Epoch 7129
Loss = 9.4095e-03, PNorm = 703.5905, GNorm = 0.2974, lr_0 = 9.9507e-04
Loss = 3.5381e-02, PNorm = 703.6213, GNorm = 0.5615, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.090863
Epoch 7130
Loss = 5.6978e-02, PNorm = 703.6432, GNorm = 1.1498, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.086315
Epoch 7131
Loss = 4.2227e-02, PNorm = 703.6671, GNorm = 0.7630, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.087984
Epoch 7132
Loss = 3.3981e-02, PNorm = 703.7113, GNorm = 0.0861, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.084751
Epoch 7133
Loss = 3.5915e-02, PNorm = 703.7505, GNorm = 0.7974, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.088845
Epoch 7134
Loss = 5.9596e-02, PNorm = 703.7818, GNorm = 1.8439, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.093633
Epoch 7135
Loss = 2.8277e-02, PNorm = 703.8131, GNorm = 3.2705, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.105294
Epoch 7136
Loss = 1.4387e-01, PNorm = 703.8425, GNorm = 0.3192, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.123764
Epoch 7137
Loss = 5.1394e-02, PNorm = 703.8696, GNorm = 1.3667, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.121532
Epoch 7138
Loss = 3.5033e-03, PNorm = 703.8944, GNorm = 0.1558, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.113979
Epoch 7139
Loss = 1.0586e-02, PNorm = 703.9181, GNorm = 0.7684, lr_0 = 9.9507e-04
Loss = 9.6281e-02, PNorm = 703.9455, GNorm = 4.7610, lr_0 = 9.9507e-04
Validation binary_cross_entropy = 0.073435
Epoch 7140
Loss = 6.1673e-02, PNorm = 704.0001, GNorm = 6.4752, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.080547
Epoch 7141
Loss = 6.1623e-02, PNorm = 704.0512, GNorm = 1.5538, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.090156
Epoch 7142
Loss = 1.8591e-02, PNorm = 704.0861, GNorm = 1.2299, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.092094
Epoch 7143
Loss = 7.1831e-02, PNorm = 704.1167, GNorm = 0.2706, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.087708
Epoch 7144
Loss = 5.4844e-02, PNorm = 704.1422, GNorm = 7.0821, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.094927
Epoch 7145
Loss = 3.8987e-02, PNorm = 704.1706, GNorm = 1.5996, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.094848
Epoch 7146
Loss = 5.6314e-02, PNorm = 704.1936, GNorm = 5.8972, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.089635
Epoch 7147
Loss = 7.0539e-02, PNorm = 704.2204, GNorm = 1.6559, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.079732
Epoch 7148
Loss = 1.8363e-02, PNorm = 704.2478, GNorm = 1.2951, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.115039
Epoch 7149
Loss = 2.5276e-03, PNorm = 704.2832, GNorm = 0.3366, lr_0 = 9.9506e-04
Loss = 4.3685e-02, PNorm = 704.3058, GNorm = 0.1163, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.134575
Epoch 7150
Loss = 8.5652e-03, PNorm = 704.3257, GNorm = 0.1326, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.161577
Epoch 7151
Loss = 1.1645e-01, PNorm = 704.3404, GNorm = 0.0974, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.081870
Epoch 7152
Loss = 9.6646e-03, PNorm = 704.3673, GNorm = 1.6623, lr_0 = 9.9506e-04
Validation binary_cross_entropy = 0.076828
Epoch 7153
Loss = 2.6950e-02, PNorm = 704.3986, GNorm = 0.1612, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.070020
Epoch 7154
Loss = 3.0230e-02, PNorm = 704.4348, GNorm = 2.0919, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.078594
Epoch 7155
Loss = 8.9941e-03, PNorm = 704.4709, GNorm = 0.9083, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.084927
Epoch 7156
Loss = 8.3456e-03, PNorm = 704.4988, GNorm = 0.7409, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.096914
Epoch 7157
Loss = 3.0306e-03, PNorm = 704.5201, GNorm = 0.4425, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.097068
Epoch 7158
Loss = 4.8293e-02, PNorm = 704.5341, GNorm = 1.0654, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.090027
Epoch 7159
Loss = 1.7579e-02, PNorm = 704.5533, GNorm = 1.2022, lr_0 = 9.9505e-04
Loss = 2.3982e-02, PNorm = 704.5851, GNorm = 1.2214, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.094227
Epoch 7160
Loss = 5.5000e-02, PNorm = 704.6060, GNorm = 4.0742, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.095942
Epoch 7161
Loss = 2.9695e-02, PNorm = 704.6481, GNorm = 0.6058, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.130863
Epoch 7162
Loss = 3.7592e-02, PNorm = 704.6832, GNorm = 0.2125, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.175522
Epoch 7163
Loss = 2.2943e-02, PNorm = 704.7200, GNorm = 1.5435, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.220181
Epoch 7164
Loss = 8.0021e-02, PNorm = 704.7552, GNorm = 0.6500, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.202922
Epoch 7165
Loss = 1.6259e-02, PNorm = 704.7871, GNorm = 0.8136, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.143276
Epoch 7166
Loss = 7.1960e-02, PNorm = 704.8076, GNorm = 0.4553, lr_0 = 9.9505e-04
Validation binary_cross_entropy = 0.086959
Epoch 7167
Loss = 2.6811e-02, PNorm = 704.8378, GNorm = 1.5934, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.078414
Epoch 7168
Loss = 7.1010e-03, PNorm = 704.8824, GNorm = 0.1918, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.078608
Epoch 7169
Loss = 9.2452e-02, PNorm = 704.9398, GNorm = 5.5020, lr_0 = 9.9504e-04
Loss = 3.8276e-02, PNorm = 704.9860, GNorm = 1.2598, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.073020
Epoch 7170
Loss = 2.0130e-02, PNorm = 705.0226, GNorm = 0.7327, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.078701
Epoch 7171
Loss = 3.5443e-02, PNorm = 705.0516, GNorm = 14.6030, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.080064
Epoch 7172
Loss = 2.8102e-02, PNorm = 705.0882, GNorm = 0.9745, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.076584
Epoch 7173
Loss = 5.7918e-02, PNorm = 705.1240, GNorm = 0.3942, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.077667
Epoch 7174
Loss = 2.9786e-02, PNorm = 705.1545, GNorm = 1.4085, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.074274
Epoch 7175
Loss = 1.3736e-02, PNorm = 705.1767, GNorm = 0.9651, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.078700
Epoch 7176
Loss = 2.2662e-02, PNorm = 705.1992, GNorm = 0.2918, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.082056
Epoch 7177
Loss = 1.6562e-02, PNorm = 705.2224, GNorm = 1.3447, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.088793
Epoch 7178
Loss = 5.2968e-02, PNorm = 705.2513, GNorm = 2.6149, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.088462
Epoch 7179
Loss = 2.0345e-03, PNorm = 705.2767, GNorm = 0.0729, lr_0 = 9.9504e-04
Loss = 2.6435e-02, PNorm = 705.2976, GNorm = 2.7440, lr_0 = 9.9504e-04
Validation binary_cross_entropy = 0.091111
Epoch 7180
Loss = 9.4341e-02, PNorm = 705.3400, GNorm = 7.2449, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.078225
Epoch 7181
Loss = 7.0540e-02, PNorm = 705.4017, GNorm = 0.7309, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.094043
Epoch 7182
Loss = 1.1281e-01, PNorm = 705.4520, GNorm = 2.0502, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.074033
Epoch 7183
Loss = 7.5956e-02, PNorm = 705.5086, GNorm = 2.1070, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.108288
Epoch 7184
Loss = 5.2937e-02, PNorm = 705.5608, GNorm = 2.1616, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.096290
Epoch 7185
Loss = 8.0742e-02, PNorm = 705.6071, GNorm = 1.0168, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.094498
Epoch 7186
Loss = 7.5679e-02, PNorm = 705.6510, GNorm = 8.3740, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.094212
Epoch 7187
Loss = 7.4498e-02, PNorm = 705.6994, GNorm = 2.3307, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.083931
Epoch 7188
Loss = 6.8321e-01, PNorm = 705.7510, GNorm = 116.3847, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.072753
Epoch 7189
Loss = 2.5614e-01, PNorm = 705.8387, GNorm = 9.5916, lr_0 = 9.9503e-04
Loss = 1.7998e-01, PNorm = 705.9261, GNorm = 2.3011, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.089440
Epoch 7190
Loss = 6.1162e-02, PNorm = 705.9971, GNorm = 0.4163, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.073830
Epoch 7191
Loss = 3.9484e-02, PNorm = 706.0539, GNorm = 0.1898, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.094857
Epoch 7192
Loss = 2.0597e-02, PNorm = 706.0921, GNorm = 0.1982, lr_0 = 9.9503e-04
Validation binary_cross_entropy = 0.088174
Epoch 7193
Loss = 6.1152e-02, PNorm = 706.1209, GNorm = 1.6898, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.086030
Epoch 7194
Loss = 1.7964e-02, PNorm = 706.1549, GNorm = 0.1732, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.090771
Epoch 7195
Loss = 2.6623e-02, PNorm = 706.1773, GNorm = 1.3178, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.080682
Epoch 7196
Loss = 5.7677e-02, PNorm = 706.2133, GNorm = 0.3823, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.069663
Epoch 7197
Loss = 1.6224e-02, PNorm = 706.2553, GNorm = 0.6651, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.091829
Epoch 7198
Loss = 3.1496e-02, PNorm = 706.3178, GNorm = 1.6892, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.079754
Epoch 7199
Loss = 1.3714e-02, PNorm = 706.3649, GNorm = 0.7249, lr_0 = 9.9502e-04
Loss = 5.1276e-02, PNorm = 706.4112, GNorm = 0.7198, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.090521
Epoch 7200
Loss = 4.5481e-02, PNorm = 706.4550, GNorm = 0.5631, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.090458
Epoch 7201
Loss = 1.9977e-02, PNorm = 706.4969, GNorm = 0.2105, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.101838
Epoch 7202
Loss = 4.5126e-02, PNorm = 706.5272, GNorm = 4.1138, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.092209
Epoch 7203
Loss = 5.3672e-02, PNorm = 706.5539, GNorm = 6.3392, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.085521
Epoch 7204
Loss = 2.6093e-02, PNorm = 706.5829, GNorm = 0.2857, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.092107
Epoch 7205
Loss = 9.9560e-03, PNorm = 706.6387, GNorm = 0.1167, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.124323
Epoch 7206
Loss = 1.8765e-02, PNorm = 706.6698, GNorm = 1.3198, lr_0 = 9.9502e-04
Validation binary_cross_entropy = 0.115069
Epoch 7207
Loss = 1.6192e-02, PNorm = 706.6986, GNorm = 0.6709, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.101539
Epoch 7208
Loss = 1.4856e-02, PNorm = 706.7104, GNorm = 0.1357, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.089343
Epoch 7209
Loss = 3.6852e-02, PNorm = 706.7308, GNorm = 2.5527, lr_0 = 9.9501e-04
Loss = 6.2721e-02, PNorm = 706.7548, GNorm = 1.0450, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.080469
Epoch 7210
Loss = 1.6849e-02, PNorm = 706.7773, GNorm = 0.4889, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.079180
Epoch 7211
Loss = 3.4729e-02, PNorm = 706.7994, GNorm = 8.0440, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.083482
Epoch 7212
Loss = 3.7000e-02, PNorm = 706.8244, GNorm = 1.5759, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.079127
Epoch 7213
Loss = 1.3595e-02, PNorm = 706.8525, GNorm = 0.2669, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.078753
Epoch 7214
Loss = 8.2273e-02, PNorm = 706.8772, GNorm = 8.0684, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.082079
Epoch 7215
Loss = 1.0784e-01, PNorm = 706.9064, GNorm = 5.4856, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.081470
Epoch 7216
Loss = 1.8735e-02, PNorm = 706.9329, GNorm = 0.0607, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.076157
Epoch 7217
Loss = 2.3213e-02, PNorm = 706.9568, GNorm = 0.2238, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.078046
Epoch 7218
Loss = 3.4204e-02, PNorm = 706.9842, GNorm = 3.0930, lr_0 = 9.9501e-04
Validation binary_cross_entropy = 0.069798
Epoch 7219
Loss = 8.2310e-02, PNorm = 707.0145, GNorm = 3.4012, lr_0 = 9.9501e-04
Loss = 2.1033e-02, PNorm = 707.0600, GNorm = 0.8692, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.061872
Epoch 7220
Loss = 4.1517e-02, PNorm = 707.1114, GNorm = 1.1302, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.072825
Epoch 7221
Loss = 3.2643e-02, PNorm = 707.1470, GNorm = 0.2993, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.079824
Epoch 7222
Loss = 1.8800e-02, PNorm = 707.1703, GNorm = 2.4875, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.078200
Epoch 7223
Loss = 6.9235e-02, PNorm = 707.1897, GNorm = 5.1100, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.069872
Epoch 7224
Loss = 3.0712e-02, PNorm = 707.2181, GNorm = 0.9187, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.064930
Epoch 7225
Loss = 4.9868e-02, PNorm = 707.3244, GNorm = 0.3468, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.092899
Epoch 7226
Loss = 3.9910e-02, PNorm = 707.4146, GNorm = 5.2077, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.092843
Epoch 7227
Loss = 2.6532e-02, PNorm = 707.4665, GNorm = 1.6877, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.087985
Epoch 7228
Loss = 9.1537e-02, PNorm = 707.4996, GNorm = 2.0241, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.086806
Epoch 7229
Loss = 2.4470e-03, PNorm = 707.5346, GNorm = 0.2239, lr_0 = 9.9500e-04
Loss = 3.1901e-02, PNorm = 707.5662, GNorm = 1.7933, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.091187
Epoch 7230
Loss = 4.7457e-02, PNorm = 707.5934, GNorm = 0.8909, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.076088
Epoch 7231
Loss = 2.5548e-02, PNorm = 707.6285, GNorm = 1.4770, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.077588
Epoch 7232
Loss = 7.7531e-03, PNorm = 707.6712, GNorm = 0.4237, lr_0 = 9.9500e-04
Validation binary_cross_entropy = 0.092913
Epoch 7233
Loss = 3.9058e-02, PNorm = 707.6938, GNorm = 0.4522, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.091162
Epoch 7234
Loss = 5.9741e-02, PNorm = 707.7122, GNorm = 4.6597, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.090783
Epoch 7235
Loss = 1.5659e-02, PNorm = 707.7488, GNorm = 0.8873, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.091432
Epoch 7236
Loss = 3.0883e-02, PNorm = 707.7778, GNorm = 1.5643, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.093365
Epoch 7237
Loss = 4.3021e-01, PNorm = 707.8011, GNorm = 16.2438, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.084582
Epoch 7238
Loss = 3.5187e-02, PNorm = 707.8425, GNorm = 2.8625, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.087034
Epoch 7239
Loss = 8.6675e-03, PNorm = 707.8826, GNorm = 0.3703, lr_0 = 9.9499e-04
Loss = 4.0777e-02, PNorm = 707.9174, GNorm = 0.3240, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.090542
Epoch 7240
Loss = 1.9042e-02, PNorm = 707.9415, GNorm = 0.6228, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.091492
Epoch 7241
Loss = 4.2236e-02, PNorm = 707.9626, GNorm = 0.5603, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.091551
Epoch 7242
Loss = 4.4519e-02, PNorm = 707.9876, GNorm = 0.1301, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.085873
Epoch 7243
Loss = 6.4582e-02, PNorm = 708.0189, GNorm = 0.5867, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.120393
Epoch 7244
Loss = 1.5720e-01, PNorm = 708.0506, GNorm = 2.8575, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.082797
Epoch 7245
Loss = 2.9525e-02, PNorm = 708.0835, GNorm = 0.4352, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.090808
Epoch 7246
Loss = 3.2026e-02, PNorm = 708.1251, GNorm = 3.2951, lr_0 = 9.9499e-04
Validation binary_cross_entropy = 0.094463
Epoch 7247
Loss = 6.4247e-02, PNorm = 708.1675, GNorm = 1.5800, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.088747
Epoch 7248
Loss = 5.8237e-02, PNorm = 708.2009, GNorm = 3.2817, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.086715
Epoch 7249
Loss = 7.0297e-03, PNorm = 708.2401, GNorm = 0.2495, lr_0 = 9.9498e-04
Loss = 3.9866e-02, PNorm = 708.2755, GNorm = 10.0871, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.086984
Epoch 7250
Loss = 6.2100e-02, PNorm = 708.3130, GNorm = 0.0812, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.085109
Epoch 7251
Loss = 4.3783e-02, PNorm = 708.3498, GNorm = 0.5277, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.080457
Epoch 7252
Loss = 4.2638e-02, PNorm = 708.3823, GNorm = 0.1136, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.089383
Epoch 7253
Loss = 4.4651e-02, PNorm = 708.4114, GNorm = 2.0344, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.106826
Epoch 7254
Loss = 9.2304e-03, PNorm = 708.4291, GNorm = 0.3222, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.082525
Epoch 7255
Loss = 1.7188e-02, PNorm = 708.4562, GNorm = 1.4538, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.083641
Epoch 7256
Loss = 9.2083e-02, PNorm = 708.4964, GNorm = 9.1788, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.120207
Epoch 7257
Loss = 1.8788e-02, PNorm = 708.5389, GNorm = 0.1829, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.098110
Epoch 7258
Loss = 9.6360e-02, PNorm = 708.5760, GNorm = 1.7946, lr_0 = 9.9498e-04
Validation binary_cross_entropy = 0.090120
Epoch 7259
Loss = 2.4288e-02, PNorm = 708.6067, GNorm = 1.4663, lr_0 = 9.9498e-04
Loss = 3.9571e-02, PNorm = 708.6434, GNorm = 0.1455, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.096352
Epoch 7260
Loss = 1.0718e-02, PNorm = 708.6724, GNorm = 0.0582, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.100155
Epoch 7261
Loss = 9.2308e-02, PNorm = 708.7021, GNorm = 1.3871, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.087165
Epoch 7262
Loss = 5.5849e-02, PNorm = 708.7492, GNorm = 3.7559, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.096423
Epoch 7263
Loss = 5.4373e-02, PNorm = 708.7907, GNorm = 5.2581, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.096607
Epoch 7264
Loss = 3.5616e-02, PNorm = 708.8271, GNorm = 0.2368, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.078456
Epoch 7265
Loss = 1.8408e-02, PNorm = 708.8589, GNorm = 0.9131, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.092900
Epoch 7266
Loss = 5.4990e-02, PNorm = 708.8944, GNorm = 2.5778, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.073704
Epoch 7267
Loss = 1.4316e-02, PNorm = 708.9255, GNorm = 0.0428, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.097709
Epoch 7268
Loss = 4.1110e-03, PNorm = 708.9651, GNorm = 0.7591, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.114090
Epoch 7269
Loss = 3.2708e-03, PNorm = 708.9989, GNorm = 0.2457, lr_0 = 9.9497e-04
Loss = 5.4817e-02, PNorm = 709.0390, GNorm = 0.5830, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.090257
Epoch 7270
Loss = 2.8198e-03, PNorm = 709.0762, GNorm = 0.0424, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.096233
Epoch 7271
Loss = 9.3851e-03, PNorm = 709.1060, GNorm = 0.0093, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.121722
Epoch 7272
Loss = 1.0988e-01, PNorm = 709.1245, GNorm = 0.0116, lr_0 = 9.9497e-04
Validation binary_cross_entropy = 0.086395
Epoch 7273
Loss = 4.6183e-03, PNorm = 709.1509, GNorm = 0.4256, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.090061
Epoch 7274
Loss = 2.1193e-02, PNorm = 709.1884, GNorm = 0.5878, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.121691
Epoch 7275
Loss = 7.8766e-02, PNorm = 709.2231, GNorm = 0.5275, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.120999
Epoch 7276
Loss = 2.7058e-02, PNorm = 709.2618, GNorm = 3.2579, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.167931
Epoch 7277
Loss = 2.9910e-02, PNorm = 709.2946, GNorm = 0.5227, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.174574
Epoch 7278
Loss = 1.3736e-01, PNorm = 709.3175, GNorm = 4.7057, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.113837
Epoch 7279
Loss = 2.3366e-03, PNorm = 709.3550, GNorm = 0.2415, lr_0 = 9.9496e-04
Loss = 1.1770e-01, PNorm = 709.4184, GNorm = 0.1130, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.088797
Epoch 7280
Loss = 2.8608e-02, PNorm = 709.4730, GNorm = 1.1956, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.105618
Epoch 7281
Loss = 5.4198e-02, PNorm = 709.4997, GNorm = 0.1082, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.078399
Epoch 7282
Loss = 1.5773e-02, PNorm = 709.5267, GNorm = 0.1127, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.082177
Epoch 7283
Loss = 2.1871e-02, PNorm = 709.5577, GNorm = 3.3225, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.100460
Epoch 7284
Loss = 4.4446e-02, PNorm = 709.5971, GNorm = 1.6142, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.129069
Epoch 7285
Loss = 1.3588e-02, PNorm = 709.6185, GNorm = 0.4762, lr_0 = 9.9496e-04
Validation binary_cross_entropy = 0.110319
Epoch 7286
Loss = 1.2128e-02, PNorm = 709.6389, GNorm = 0.1223, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.089635
Epoch 7287
Loss = 4.4637e-02, PNorm = 709.6661, GNorm = 6.7877, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.085988
Epoch 7288
Loss = 6.9880e-03, PNorm = 709.7228, GNorm = 0.7581, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.095568
Epoch 7289
Loss = 1.2714e-02, PNorm = 709.7667, GNorm = 0.7853, lr_0 = 9.9495e-04
Loss = 6.8454e-02, PNorm = 709.7955, GNorm = 0.1995, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.093732
Epoch 7290
Loss = 1.2593e-01, PNorm = 709.8236, GNorm = 0.1319, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.079634
Epoch 7291
Loss = 5.7041e-02, PNorm = 709.8641, GNorm = 2.1933, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.067645
Epoch 7292
Loss = 4.2522e-02, PNorm = 709.9152, GNorm = 2.0341, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.077183
Epoch 7293
Loss = 2.4689e-02, PNorm = 709.9505, GNorm = 0.5435, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.076092
Epoch 7294
Loss = 3.0770e-02, PNorm = 709.9715, GNorm = 0.3621, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.077380
Epoch 7295
Loss = 6.9246e-02, PNorm = 709.9951, GNorm = 3.1833, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.073353
Epoch 7296
Loss = 4.1534e-02, PNorm = 710.0196, GNorm = 2.7673, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.079464
Epoch 7297
Loss = 4.0105e-02, PNorm = 710.0698, GNorm = 2.8743, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.101681
Epoch 7298
Loss = 1.4844e-01, PNorm = 710.1252, GNorm = 6.9204, lr_0 = 9.9495e-04
Validation binary_cross_entropy = 0.079156
Epoch 7299
Loss = 1.2200e-02, PNorm = 710.1712, GNorm = 0.3217, lr_0 = 9.9495e-04
Loss = 2.9222e-02, PNorm = 710.2158, GNorm = 0.4200, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.081657
Epoch 7300
Loss = 5.7130e-02, PNorm = 710.2665, GNorm = 1.7053, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.083221
Epoch 7301
Loss = 1.1760e-01, PNorm = 710.3005, GNorm = 1.9213, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.072426
Epoch 7302
Loss = 5.3696e-02, PNorm = 710.3383, GNorm = 1.4862, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.077685
Epoch 7303
Loss = 1.7307e-02, PNorm = 710.3810, GNorm = 1.2342, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.093668
Epoch 7304
Loss = 4.1658e-02, PNorm = 710.4108, GNorm = 0.1003, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.081215
Epoch 7305
Loss = 6.1872e-02, PNorm = 710.4383, GNorm = 2.0013, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.083367
Epoch 7306
Loss = 2.8482e-02, PNorm = 710.4645, GNorm = 1.5743, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.077971
Epoch 7307
Loss = 8.6078e-03, PNorm = 710.4899, GNorm = 0.8167, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.086223
Epoch 7308
Loss = 2.0345e-03, PNorm = 710.5247, GNorm = 0.1179, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.084989
Epoch 7309
Loss = 2.7041e-03, PNorm = 710.5504, GNorm = 0.1131, lr_0 = 9.9494e-04
Loss = 6.7243e-02, PNorm = 710.5662, GNorm = 0.5757, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.072030
Epoch 7310
Loss = 4.7395e-02, PNorm = 710.6067, GNorm = 0.3125, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.076206
Epoch 7311
Loss = 2.8480e-02, PNorm = 710.6409, GNorm = 0.2734, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.081497
Epoch 7312
Loss = 2.7152e-02, PNorm = 710.6707, GNorm = 1.8712, lr_0 = 9.9494e-04
Validation binary_cross_entropy = 0.077044
Epoch 7313
Loss = 8.7115e-02, PNorm = 710.6970, GNorm = 23.2742, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.072290
Epoch 7314
Loss = 8.1448e-02, PNorm = 710.7529, GNorm = 4.4088, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.082562
Epoch 7315
Loss = 4.8572e-02, PNorm = 710.8255, GNorm = 3.6936, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.124557
Epoch 7316
Loss = 1.6439e-01, PNorm = 710.8670, GNorm = 3.2723, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.074287
Epoch 7317
Loss = 1.5827e-01, PNorm = 710.9108, GNorm = 10.2073, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.081325
Epoch 7318
Loss = 4.7325e-02, PNorm = 710.9730, GNorm = 4.4793, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.100246
Epoch 7319
Loss = 2.2974e-01, PNorm = 711.0268, GNorm = 6.9122, lr_0 = 9.9493e-04
Loss = 5.3027e-02, PNorm = 711.0612, GNorm = 1.0269, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.068023
Epoch 7320
Loss = 1.9382e-02, PNorm = 711.0928, GNorm = 4.5980, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.097744
Epoch 7321
Loss = 5.9155e-02, PNorm = 711.1298, GNorm = 4.9296, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.088651
Epoch 7322
Loss = 4.2323e-02, PNorm = 711.1861, GNorm = 2.9658, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.099303
Epoch 7323
Loss = 4.1650e-02, PNorm = 711.2314, GNorm = 1.5892, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.084067
Epoch 7324
Loss = 7.5780e-02, PNorm = 711.2739, GNorm = 1.9664, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.098593
Epoch 7325
Loss = 9.9141e-03, PNorm = 711.3147, GNorm = 0.0963, lr_0 = 9.9493e-04
Validation binary_cross_entropy = 0.097018
Epoch 7326
Loss = 6.8448e-02, PNorm = 711.3461, GNorm = 1.5451, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.093131
Epoch 7327
Loss = 1.0824e-01, PNorm = 711.3717, GNorm = 0.3536, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.084373
Epoch 7328
Loss = 2.5464e-02, PNorm = 711.3994, GNorm = 1.1945, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.088718
Epoch 7329
Loss = 1.6528e-01, PNorm = 711.4249, GNorm = 7.7020, lr_0 = 9.9492e-04
Loss = 8.7887e-03, PNorm = 711.4452, GNorm = 3.1899, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.159133
Epoch 7330
Loss = 9.1739e-02, PNorm = 711.4603, GNorm = 1.0037, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.096259
Epoch 7331
Loss = 9.6192e-02, PNorm = 711.4940, GNorm = 1.0346, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.088378
Epoch 7332
Loss = 5.0556e-02, PNorm = 711.5365, GNorm = 3.0281, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.062063
Epoch 7333
Loss = 7.2765e-02, PNorm = 711.5864, GNorm = 2.2376, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.073646
Epoch 7334
Loss = 1.8760e-02, PNorm = 711.6334, GNorm = 0.3816, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.101088
Epoch 7335
Loss = 1.4763e-02, PNorm = 711.6665, GNorm = 0.7140, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.170606
Epoch 7336
Loss = 1.3498e-01, PNorm = 711.7062, GNorm = 4.9856, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.089895
Epoch 7337
Loss = 1.9718e-01, PNorm = 711.7391, GNorm = 0.8195, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.064008
Epoch 7338
Loss = 3.6999e-02, PNorm = 711.7771, GNorm = 0.8749, lr_0 = 9.9492e-04
Validation binary_cross_entropy = 0.075495
Epoch 7339
Loss = 1.1797e-01, PNorm = 711.8126, GNorm = 2.2275, lr_0 = 9.9492e-04
Loss = 3.1069e-02, PNorm = 711.8397, GNorm = 0.1418, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.079008
Epoch 7340
Loss = 2.4683e-02, PNorm = 711.8619, GNorm = 0.6866, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.099496
Epoch 7341
Loss = 2.8112e-02, PNorm = 711.8812, GNorm = 1.2843, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.121345
Epoch 7342
Loss = 6.0822e-02, PNorm = 711.8966, GNorm = 4.1649, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.084084
Epoch 7343
Loss = 2.9277e-02, PNorm = 711.9375, GNorm = 1.1804, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.096408
Epoch 7344
Loss = 3.7284e-02, PNorm = 711.9799, GNorm = 0.0551, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.089662
Epoch 7345
Loss = 8.2770e-03, PNorm = 712.0096, GNorm = 0.7702, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.085822
Epoch 7346
Loss = 8.9527e-03, PNorm = 712.0274, GNorm = 0.0406, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.085977
Epoch 7347
Loss = 6.2788e-03, PNorm = 712.0407, GNorm = 0.8085, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.092456
Epoch 7348
Loss = 7.0934e-03, PNorm = 712.0578, GNorm = 0.6854, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.092876
Epoch 7349
Loss = 1.8896e-03, PNorm = 712.0798, GNorm = 0.0468, lr_0 = 9.9491e-04
Loss = 3.6243e-02, PNorm = 712.1056, GNorm = 3.2081, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.093254
Epoch 7350
Loss = 8.0999e-02, PNorm = 712.1203, GNorm = 1.3105, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.070578
Epoch 7351
Loss = 2.5209e-02, PNorm = 712.1461, GNorm = 2.4938, lr_0 = 9.9491e-04
Validation binary_cross_entropy = 0.068449
Epoch 7352
Loss = 2.0913e-02, PNorm = 712.1820, GNorm = 0.3848, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.087502
Epoch 7353
Loss = 6.4962e-02, PNorm = 712.2110, GNorm = 8.2757, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.085137
Epoch 7354
Loss = 2.4665e-02, PNorm = 712.2410, GNorm = 0.2034, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.085310
Epoch 7355
Loss = 1.0474e-02, PNorm = 712.2732, GNorm = 0.1229, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.093132
Epoch 7356
Loss = 5.9206e-02, PNorm = 712.3055, GNorm = 4.3657, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.091218
Epoch 7357
Loss = 1.7248e-02, PNorm = 712.3455, GNorm = 0.0824, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.099330
Epoch 7358
Loss = 8.9545e-04, PNorm = 712.3709, GNorm = 0.0419, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.120182
Epoch 7359
Loss = 1.6583e-01, PNorm = 712.3888, GNorm = 2.5443, lr_0 = 9.9490e-04
Loss = 2.6536e-02, PNorm = 712.4020, GNorm = 0.0150, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.096774
Epoch 7360
Loss = 1.9582e-02, PNorm = 712.4294, GNorm = 0.4763, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.097715
Epoch 7361
Loss = 5.7519e-02, PNorm = 712.4610, GNorm = 1.1780, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.089720
Epoch 7362
Loss = 1.3652e-02, PNorm = 712.4843, GNorm = 0.2397, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.092826
Epoch 7363
Loss = 1.5940e-02, PNorm = 712.5135, GNorm = 0.1214, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.099361
Epoch 7364
Loss = 3.4634e-02, PNorm = 712.5360, GNorm = 0.9277, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.091193
Epoch 7365
Loss = 2.1332e-02, PNorm = 712.5589, GNorm = 2.2190, lr_0 = 9.9490e-04
Validation binary_cross_entropy = 0.091019
Epoch 7366
Loss = 1.9360e-02, PNorm = 712.5913, GNorm = 3.2862, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.095172
Epoch 7367
Loss = 7.4540e-03, PNorm = 712.6190, GNorm = 0.9951, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.095962
Epoch 7368
Loss = 7.5569e-03, PNorm = 712.6387, GNorm = 0.0514, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.096430
Epoch 7369
Loss = 1.4022e-03, PNorm = 712.6645, GNorm = 0.0601, lr_0 = 9.9489e-04
Loss = 9.7947e-03, PNorm = 712.6902, GNorm = 0.4242, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.102999
Epoch 7370
Loss = 3.2212e-02, PNorm = 712.7160, GNorm = 0.0976, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.096367
Epoch 7371
Loss = 1.9192e-02, PNorm = 712.7526, GNorm = 5.1043, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.107101
Epoch 7372
Loss = 3.4847e-02, PNorm = 712.7913, GNorm = 0.7638, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.117530
Epoch 7373
Loss = 4.6102e-02, PNorm = 712.8258, GNorm = 0.2174, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.095949
Epoch 7374
Loss = 1.6310e-02, PNorm = 712.8541, GNorm = 1.6291, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.085190
Epoch 7375
Loss = 1.3902e-02, PNorm = 712.8897, GNorm = 1.2295, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.094152
Epoch 7376
Loss = 2.5722e-02, PNorm = 712.9213, GNorm = 0.0339, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.119764
Epoch 7377
Loss = 6.2511e-03, PNorm = 712.9606, GNorm = 0.0197, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.094933
Epoch 7378
Loss = 2.0814e-02, PNorm = 713.0025, GNorm = 1.2826, lr_0 = 9.9489e-04
Validation binary_cross_entropy = 0.115271
Epoch 7379
Loss = 4.3419e-02, PNorm = 713.0530, GNorm = 1.3500, lr_0 = 9.9489e-04
Loss = 2.3903e-02, PNorm = 713.0890, GNorm = 0.0907, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.122015
Epoch 7380
Loss = 1.0371e-01, PNorm = 713.1203, GNorm = 8.7875, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.106249
Epoch 7381
Loss = 4.9835e-02, PNorm = 713.1712, GNorm = 6.0712, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.099586
Epoch 7382
Loss = 3.9174e-02, PNorm = 713.2389, GNorm = 0.0934, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.094456
Epoch 7383
Loss = 3.6023e-02, PNorm = 713.2783, GNorm = 1.3042, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.086348
Epoch 7384
Loss = 5.2219e-02, PNorm = 713.3160, GNorm = 4.7157, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.092192
Epoch 7385
Loss = 1.3705e-02, PNorm = 713.3451, GNorm = 0.1099, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.075083
Epoch 7386
Loss = 8.3887e-02, PNorm = 713.3912, GNorm = 3.4834, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.082852
Epoch 7387
Loss = 6.6707e-02, PNorm = 713.4645, GNorm = 2.8004, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.080928
Epoch 7388
Loss = 4.3301e-02, PNorm = 713.5038, GNorm = 2.5196, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.075395
Epoch 7389
Loss = 1.3191e-02, PNorm = 713.5314, GNorm = 0.7067, lr_0 = 9.9488e-04
Loss = 2.9008e-02, PNorm = 713.5550, GNorm = 4.0695, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.080017
Epoch 7390
Loss = 8.0099e-03, PNorm = 713.5827, GNorm = 0.4189, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.091174
Epoch 7391
Loss = 4.5038e-02, PNorm = 713.6023, GNorm = 4.7595, lr_0 = 9.9488e-04
Validation binary_cross_entropy = 0.090154
Epoch 7392
Loss = 3.9878e-02, PNorm = 713.6214, GNorm = 1.5058, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.078079
Epoch 7393
Loss = 6.2835e-02, PNorm = 713.6458, GNorm = 0.6299, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.078180
Epoch 7394
Loss = 1.6966e-02, PNorm = 713.6793, GNorm = 0.6039, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.077855
Epoch 7395
Loss = 1.0375e-03, PNorm = 713.7018, GNorm = 0.0518, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.077310
Epoch 7396
Loss = 9.1852e-03, PNorm = 713.7221, GNorm = 1.3377, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.080600
Epoch 7397
Loss = 7.7740e-03, PNorm = 713.7442, GNorm = 1.5442, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.087115
Epoch 7398
Loss = 6.6798e-03, PNorm = 713.7701, GNorm = 0.5323, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.090722
Epoch 7399
Loss = 2.0173e-04, PNorm = 713.7871, GNorm = 0.0175, lr_0 = 9.9487e-04
Loss = 3.2467e-02, PNorm = 713.8041, GNorm = 0.0510, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.083304
Epoch 7400
Loss = 1.7831e-02, PNorm = 713.8227, GNorm = 0.9474, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.083656
Epoch 7401
Loss = 4.4459e-02, PNorm = 713.8526, GNorm = 0.0586, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.086226
Epoch 7402
Loss = 3.8231e-02, PNorm = 713.8837, GNorm = 1.8255, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.087756
Epoch 7403
Loss = 2.2427e-02, PNorm = 713.9216, GNorm = 0.0193, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.090123
Epoch 7404
Loss = 2.4703e-02, PNorm = 713.9600, GNorm = 1.8820, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.095194
Epoch 7405
Loss = 1.1615e-02, PNorm = 713.9936, GNorm = 0.3625, lr_0 = 9.9487e-04
Validation binary_cross_entropy = 0.084425
Epoch 7406
Loss = 1.2081e-02, PNorm = 714.0312, GNorm = 0.2287, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.080526
Epoch 7407
Loss = 2.9739e-02, PNorm = 714.0708, GNorm = 2.8231, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.094164
Epoch 7408
Loss = 1.5977e-02, PNorm = 714.1084, GNorm = 1.3344, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.072793
Epoch 7409
Loss = 5.1905e-02, PNorm = 714.1412, GNorm = 0.7754, lr_0 = 9.9486e-04
Loss = 2.0569e-02, PNorm = 714.1654, GNorm = 2.7450, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.071495
Epoch 7410
Loss = 7.3328e-02, PNorm = 714.2073, GNorm = 0.7847, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.074607
Epoch 7411
Loss = 2.8795e-02, PNorm = 714.2477, GNorm = 0.0455, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.084943
Epoch 7412
Loss = 2.2324e-02, PNorm = 714.2780, GNorm = 1.6518, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.097078
Epoch 7413
Loss = 1.2327e-02, PNorm = 714.3042, GNorm = 0.0063, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.099812
Epoch 7414
Loss = 1.8666e-03, PNorm = 714.3231, GNorm = 0.0238, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.087481
Epoch 7415
Loss = 3.5271e-03, PNorm = 714.3396, GNorm = 0.6916, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.091710
Epoch 7416
Loss = 4.7686e-03, PNorm = 714.3552, GNorm = 1.1468, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.091679
Epoch 7417
Loss = 9.6213e-02, PNorm = 714.3731, GNorm = 6.3377, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.074976
Epoch 7418
Loss = 4.5940e-02, PNorm = 714.4249, GNorm = 1.8310, lr_0 = 9.9486e-04
Validation binary_cross_entropy = 0.095286
Epoch 7419
Loss = 1.2859e-02, PNorm = 714.4891, GNorm = 0.7166, lr_0 = 9.9485e-04
Loss = 1.5358e-02, PNorm = 714.5414, GNorm = 0.1720, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.202394
Epoch 7420
Loss = 2.8914e-02, PNorm = 714.5702, GNorm = 0.7632, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.132865
Epoch 7421
Loss = 4.0391e-02, PNorm = 714.6053, GNorm = 5.2516, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.092223
Epoch 7422
Loss = 2.6526e-02, PNorm = 714.6719, GNorm = 0.0807, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.107734
Epoch 7423
Loss = 7.0930e-02, PNorm = 714.7191, GNorm = 0.2583, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.104285
Epoch 7424
Loss = 9.1163e-02, PNorm = 714.7911, GNorm = 2.8007, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.074290
Epoch 7425
Loss = 7.3896e-02, PNorm = 714.8725, GNorm = 1.6510, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.082330
Epoch 7426
Loss = 2.2714e-02, PNorm = 714.9430, GNorm = 0.1360, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.095400
Epoch 7427
Loss = 2.8893e-02, PNorm = 714.9826, GNorm = 0.7586, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.093764
Epoch 7428
Loss = 8.6237e-03, PNorm = 715.0118, GNorm = 1.1431, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.086441
Epoch 7429
Loss = 4.1419e-02, PNorm = 715.0405, GNorm = 2.5457, lr_0 = 9.9485e-04
Loss = 6.5634e-02, PNorm = 715.0849, GNorm = 0.2414, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.075105
Epoch 7430
Loss = 7.2953e-02, PNorm = 715.1410, GNorm = 4.8733, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.083119
Epoch 7431
Loss = 3.8665e-02, PNorm = 715.1862, GNorm = 4.6581, lr_0 = 9.9485e-04
Validation binary_cross_entropy = 0.100983
Epoch 7432
Loss = 4.8295e-02, PNorm = 715.2169, GNorm = 0.1941, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.072434
Epoch 7433
Loss = 1.4862e-02, PNorm = 715.2428, GNorm = 0.3651, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.071475
Epoch 7434
Loss = 3.4304e-02, PNorm = 715.2784, GNorm = 0.2738, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.081871
Epoch 7435
Loss = 5.4545e-03, PNorm = 715.3161, GNorm = 0.1160, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.088603
Epoch 7436
Loss = 1.4804e-02, PNorm = 715.3449, GNorm = 0.8634, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.084658
Epoch 7437
Loss = 5.1363e-03, PNorm = 715.3679, GNorm = 0.9145, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.204138
Epoch 7438
Loss = 2.0322e-03, PNorm = 715.4046, GNorm = 0.1107, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.081383
Epoch 7439
Loss = 4.4125e-02, PNorm = 715.5007, GNorm = 1.9923, lr_0 = 9.9484e-04
Loss = 5.5761e-01, PNorm = 715.9497, GNorm = 7.4760, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.121172
Epoch 7440
Loss = 2.1571e-01, PNorm = 716.2517, GNorm = 10.4201, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.239026
Epoch 7441
Loss = 2.9008e-01, PNorm = 716.4216, GNorm = 3.6900, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.221733
Epoch 7442
Loss = 2.6683e-01, PNorm = 716.5451, GNorm = 4.7166, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.142489
Epoch 7443
Loss = 1.6158e-01, PNorm = 716.6429, GNorm = 2.5330, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.115857
Epoch 7444
Loss = 1.2801e-01, PNorm = 716.7175, GNorm = 1.6277, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.150256
Epoch 7445
Loss = 1.5900e-01, PNorm = 716.7893, GNorm = 2.3025, lr_0 = 9.9484e-04
Validation binary_cross_entropy = 0.162938
Epoch 7446
Loss = 1.1776e-01, PNorm = 716.8502, GNorm = 2.8655, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.122772
Epoch 7447
Loss = 7.5003e-02, PNorm = 716.9072, GNorm = 0.8884, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.138444
Epoch 7448
Loss = 9.2240e-02, PNorm = 716.9617, GNorm = 2.0239, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.104898
Epoch 7449
Loss = 1.1290e-01, PNorm = 716.9992, GNorm = 4.3964, lr_0 = 9.9483e-04
Loss = 1.0750e-01, PNorm = 717.0430, GNorm = 2.2817, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.131365
Epoch 7450
Loss = 9.1738e-02, PNorm = 717.0766, GNorm = 0.3405, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.125514
Epoch 7451
Loss = 1.2834e-01, PNorm = 717.1146, GNorm = 5.1290, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.130023
Epoch 7452
Loss = 2.0119e-01, PNorm = 717.1479, GNorm = 1.6585, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.118628
Epoch 7453
Loss = 8.8292e-02, PNorm = 717.2181, GNorm = 4.8667, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.126671
Epoch 7454
Loss = 1.3664e-01, PNorm = 717.2875, GNorm = 2.1300, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.100550
Epoch 7455
Loss = 9.0110e-02, PNorm = 717.3512, GNorm = 0.8652, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.127439
Epoch 7456
Loss = 1.0287e-01, PNorm = 717.4039, GNorm = 2.1486, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.108871
Epoch 7457
Loss = 5.5374e-02, PNorm = 717.4357, GNorm = 1.7523, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.109786
Epoch 7458
Loss = 7.5874e-02, PNorm = 717.4746, GNorm = 2.2814, lr_0 = 9.9483e-04
Validation binary_cross_entropy = 0.160355
Epoch 7459
Loss = 7.1694e-02, PNorm = 717.5140, GNorm = 4.0520, lr_0 = 9.9482e-04
Loss = 4.0063e-02, PNorm = 717.5367, GNorm = 0.7311, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.118057
Epoch 7460
Loss = 6.0705e-02, PNorm = 717.5682, GNorm = 0.1512, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.133172
Epoch 7461
Loss = 1.3487e-01, PNorm = 717.6032, GNorm = 16.9567, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.124743
Epoch 7462
Loss = 4.3579e-02, PNorm = 717.6404, GNorm = 1.5847, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.092986
Epoch 7463
Loss = 3.6969e-02, PNorm = 717.6839, GNorm = 2.0997, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.100321
Epoch 7464
Loss = 8.3904e-02, PNorm = 717.7200, GNorm = 0.1449, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.098204
Epoch 7465
Loss = 4.3434e-02, PNorm = 717.7551, GNorm = 3.2266, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.103498
Epoch 7466
Loss = 5.5762e-02, PNorm = 717.7888, GNorm = 1.6655, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.112508
Epoch 7467
Loss = 6.4844e-02, PNorm = 717.8258, GNorm = 0.1380, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.104213
Epoch 7468
Loss = 8.3836e-02, PNorm = 717.8574, GNorm = 1.8185, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.110302
Epoch 7469
Loss = 7.0056e-03, PNorm = 717.8945, GNorm = 0.5296, lr_0 = 9.9482e-04
Loss = 5.5389e-02, PNorm = 717.9270, GNorm = 1.0546, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.124679
Epoch 7470
Loss = 4.4695e-02, PNorm = 717.9536, GNorm = 2.4698, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.123999
Epoch 7471
Loss = 1.0398e-01, PNorm = 717.9817, GNorm = 3.2537, lr_0 = 9.9482e-04
Validation binary_cross_entropy = 0.098662
Epoch 7472
Loss = 3.5280e-02, PNorm = 718.0186, GNorm = 2.6875, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.097054
Epoch 7473
Loss = 2.5284e-02, PNorm = 718.0683, GNorm = 0.2680, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.125462
Epoch 7474
Loss = 4.2642e-02, PNorm = 718.0996, GNorm = 0.8641, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.117286
Epoch 7475
Loss = 4.7693e-02, PNorm = 718.1435, GNorm = 3.5943, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.122155
Epoch 7476
Loss = 3.1336e-02, PNorm = 718.1750, GNorm = 2.3522, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.130930
Epoch 7477
Loss = 5.6291e-03, PNorm = 718.2234, GNorm = 0.8879, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.142735
Epoch 7478
Loss = 2.6535e-02, PNorm = 718.2597, GNorm = 2.8966, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.124789
Epoch 7479
Loss = 1.4188e-03, PNorm = 718.2896, GNorm = 0.1267, lr_0 = 9.9481e-04
Loss = 5.7993e-02, PNorm = 718.3224, GNorm = 0.7040, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.113444
Epoch 7480
Loss = 3.4384e-02, PNorm = 718.3584, GNorm = 6.8482, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.126870
Epoch 7481
Loss = 2.6690e-01, PNorm = 718.4092, GNorm = 3.1343, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.126839
Epoch 7482
Loss = 1.1855e-01, PNorm = 718.4848, GNorm = 1.5527, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.149654
Epoch 7483
Loss = 1.3167e-01, PNorm = 718.5493, GNorm = 2.3871, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.144506
Epoch 7484
Loss = 6.7244e-02, PNorm = 718.6045, GNorm = 2.1753, lr_0 = 9.9481e-04
Validation binary_cross_entropy = 0.136333
Epoch 7485
Loss = 1.0589e-01, PNorm = 718.6501, GNorm = 1.0237, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.129191
Epoch 7486
Loss = 6.8301e-02, PNorm = 718.6974, GNorm = 0.5931, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.107339
Epoch 7487
Loss = 4.1632e-02, PNorm = 718.7425, GNorm = 0.6464, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.132391
Epoch 7488
Loss = 2.7484e-02, PNorm = 718.7859, GNorm = 1.5317, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.117189
Epoch 7489
Loss = 2.9224e-02, PNorm = 718.8135, GNorm = 0.8396, lr_0 = 9.9480e-04
Loss = 5.2475e-02, PNorm = 718.8442, GNorm = 2.4441, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.109071
Epoch 7490
Loss = 6.1625e-02, PNorm = 718.8764, GNorm = 0.5051, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.108182
Epoch 7491
Loss = 3.1691e-02, PNorm = 718.9135, GNorm = 1.8115, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.107939
Epoch 7492
Loss = 1.4055e-01, PNorm = 718.9731, GNorm = 4.2560, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.115235
Epoch 7493
Loss = 9.1484e-02, PNorm = 719.0766, GNorm = 9.0044, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.113368
Epoch 7494
Loss = 2.2931e-01, PNorm = 719.1553, GNorm = 8.4594, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.127864
Epoch 7495
Loss = 4.6025e-02, PNorm = 719.2262, GNorm = 1.9260, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.106452
Epoch 7496
Loss = 6.4609e-02, PNorm = 719.2763, GNorm = 5.8633, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.140342
Epoch 7497
Loss = 7.0703e-02, PNorm = 719.3209, GNorm = 6.7837, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.154402
Epoch 7498
Loss = 4.1670e-02, PNorm = 719.3495, GNorm = 2.0208, lr_0 = 9.9480e-04
Validation binary_cross_entropy = 0.109063
Epoch 7499
Loss = 1.0433e+00, PNorm = 719.3721, GNorm = 33.2813, lr_0 = 9.9479e-04
Loss = 7.4938e-02, PNorm = 719.4178, GNorm = 2.3029, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.084689
Epoch 7500
Loss = 5.5869e-02, PNorm = 719.4747, GNorm = 1.4334, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.124218
Epoch 7501
Loss = 5.3859e-02, PNorm = 719.5068, GNorm = 2.6158, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.101774
Epoch 7502
Loss = 4.6074e-02, PNorm = 719.5390, GNorm = 15.9464, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.123345
Epoch 7503
Loss = 3.5264e-02, PNorm = 719.5727, GNorm = 1.1285, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.121492
Epoch 7504
Loss = 7.4157e-02, PNorm = 719.5964, GNorm = 2.7942, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.112960
Epoch 7505
Loss = 5.0984e-02, PNorm = 719.6357, GNorm = 1.2441, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.109181
Epoch 7506
Loss = 2.6346e-02, PNorm = 719.6722, GNorm = 2.6994, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.098186
Epoch 7507
Loss = 3.1904e-02, PNorm = 719.7042, GNorm = 0.3458, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.094860
Epoch 7508
Loss = 5.8312e-02, PNorm = 719.7364, GNorm = 3.8311, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.101829
Epoch 7509
Loss = 1.1120e-02, PNorm = 719.7843, GNorm = 0.6259, lr_0 = 9.9479e-04
Loss = 4.7758e-02, PNorm = 719.8222, GNorm = 2.3366, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.099666
Epoch 7510
Loss = 1.7977e-02, PNorm = 719.8560, GNorm = 1.1598, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.094661
Epoch 7511
Loss = 1.4070e-02, PNorm = 719.8874, GNorm = 0.2755, lr_0 = 9.9479e-04
Validation binary_cross_entropy = 0.103347
Epoch 7512
Loss = 1.3945e-02, PNorm = 719.9135, GNorm = 0.3095, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.115885
Epoch 7513
Loss = 1.0220e-02, PNorm = 719.9405, GNorm = 3.4251, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.131498
Epoch 7514
Loss = 8.0315e-03, PNorm = 719.9612, GNorm = 0.8615, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.135885
Epoch 7515
Loss = 8.8532e-02, PNorm = 719.9738, GNorm = 0.0322, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.108339
Epoch 7516
Loss = 1.1513e-02, PNorm = 719.9926, GNorm = 1.8108, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.096430
Epoch 7517
Loss = 2.9236e-02, PNorm = 720.0161, GNorm = 0.0550, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.096910
Epoch 7518
Loss = 7.5431e-02, PNorm = 720.0510, GNorm = 1.6771, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.100166
Epoch 7519
Loss = 1.5378e-01, PNorm = 720.0839, GNorm = 5.2228, lr_0 = 9.9478e-04
Loss = 3.3052e-02, PNorm = 720.1175, GNorm = 0.1815, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.099008
Epoch 7520
Loss = 1.3523e-02, PNorm = 720.1468, GNorm = 1.4218, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.105962
Epoch 7521
Loss = 1.7590e-02, PNorm = 720.1681, GNorm = 0.2299, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.103986
Epoch 7522
Loss = 9.1129e-02, PNorm = 720.1887, GNorm = 4.9885, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.101586
Epoch 7523
Loss = 2.3829e-02, PNorm = 720.2173, GNorm = 1.0765, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.100386
Epoch 7524
Loss = 7.0192e-03, PNorm = 720.2384, GNorm = 0.7923, lr_0 = 9.9478e-04
Validation binary_cross_entropy = 0.094978
Epoch 7525
Loss = 7.7111e-02, PNorm = 720.2577, GNorm = 0.5259, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.102272
Epoch 7526
Loss = 2.1688e-02, PNorm = 720.2932, GNorm = 0.8135, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.111542
Epoch 7527
Loss = 7.2695e-03, PNorm = 720.3238, GNorm = 0.1215, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.114634
Epoch 7528
Loss = 3.8318e-03, PNorm = 720.3479, GNorm = 0.4568, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.101008
Epoch 7529
Loss = 1.4224e-02, PNorm = 720.3639, GNorm = 0.9634, lr_0 = 9.9477e-04
Loss = 2.9904e-02, PNorm = 720.3911, GNorm = 0.2137, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.098861
Epoch 7530
Loss = 1.0693e-02, PNorm = 720.4150, GNorm = 0.3699, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.102013
Epoch 7531
Loss = 1.0746e-02, PNorm = 720.4393, GNorm = 0.5409, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.110661
Epoch 7532
Loss = 1.5654e-02, PNorm = 720.4641, GNorm = 0.2257, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.117266
Epoch 7533
Loss = 8.4244e-03, PNorm = 720.4794, GNorm = 0.1182, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.111704
Epoch 7534
Loss = 3.8740e-02, PNorm = 720.4984, GNorm = 2.3851, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.112256
Epoch 7535
Loss = 2.2354e-02, PNorm = 720.5227, GNorm = 1.8255, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.112836
Epoch 7536
Loss = 9.2347e-02, PNorm = 720.5506, GNorm = 0.5911, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.101131
Epoch 7537
Loss = 4.2330e-02, PNorm = 720.5767, GNorm = 1.1875, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.095200
Epoch 7538
Loss = 3.5805e-02, PNorm = 720.6095, GNorm = 2.9764, lr_0 = 9.9477e-04
Validation binary_cross_entropy = 0.102156
Epoch 7539
Loss = 1.0260e-01, PNorm = 720.6640, GNorm = 5.0549, lr_0 = 9.9476e-04
Loss = 2.1971e-02, PNorm = 720.7057, GNorm = 0.3521, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.106578
Epoch 7540
Loss = 3.7209e-02, PNorm = 720.7367, GNorm = 0.3405, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.106908
Epoch 7541
Loss = 1.5523e-02, PNorm = 720.7682, GNorm = 0.1958, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.120594
Epoch 7542
Loss = 1.9801e-02, PNorm = 720.7982, GNorm = 2.2035, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.113580
Epoch 7543
Loss = 5.6689e-02, PNorm = 720.8218, GNorm = 1.5745, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.099932
Epoch 7544
Loss = 5.3722e-02, PNorm = 720.8564, GNorm = 1.5325, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.102148
Epoch 7545
Loss = 2.5658e-02, PNorm = 720.9010, GNorm = 0.1998, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.105838
Epoch 7546
Loss = 9.3029e-03, PNorm = 720.9292, GNorm = 0.2775, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.103300
Epoch 7547
Loss = 2.1507e-02, PNorm = 720.9492, GNorm = 0.5595, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.103516
Epoch 7548
Loss = 3.9718e-03, PNorm = 720.9684, GNorm = 0.1187, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.106050
Epoch 7549
Loss = 3.8814e-02, PNorm = 720.9928, GNorm = 3.0780, lr_0 = 9.9476e-04
Loss = 2.0350e-02, PNorm = 721.0187, GNorm = 0.6793, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.115984
Epoch 7550
Loss = 1.2026e-02, PNorm = 721.0368, GNorm = 1.3475, lr_0 = 9.9476e-04
Validation binary_cross_entropy = 0.113561
Epoch 7551
Loss = 1.0771e-01, PNorm = 721.0571, GNorm = 1.1872, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.092610
Epoch 7552
Loss = 3.7485e-02, PNorm = 721.0969, GNorm = 0.2318, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.082878
Epoch 7553
Loss = 3.5314e-02, PNorm = 721.1485, GNorm = 0.1005, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.100220
Epoch 7554
Loss = 9.5598e-02, PNorm = 721.1920, GNorm = 2.2198, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.093765
Epoch 7555
Loss = 3.1735e-02, PNorm = 721.2212, GNorm = 2.8710, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.112393
Epoch 7556
Loss = 4.0679e-02, PNorm = 721.2479, GNorm = 2.8666, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.097553
Epoch 7557
Loss = 3.3286e-02, PNorm = 721.2684, GNorm = 1.9515, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.088290
Epoch 7558
Loss = 1.0852e-02, PNorm = 721.2935, GNorm = 0.3408, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.106353
Epoch 7559
Loss = 8.7037e-03, PNorm = 721.3332, GNorm = 0.6640, lr_0 = 9.9475e-04
Loss = 4.2186e-02, PNorm = 721.3542, GNorm = 0.2045, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.101882
Epoch 7560
Loss = 4.2758e-02, PNorm = 721.3782, GNorm = 0.3392, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.096895
Epoch 7561
Loss = 4.4091e-02, PNorm = 721.4097, GNorm = 0.5996, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.093833
Epoch 7562
Loss = 1.6531e-02, PNorm = 721.4377, GNorm = 0.0795, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.127462
Epoch 7563
Loss = 1.6353e-02, PNorm = 721.4686, GNorm = 0.1658, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.166378
Epoch 7564
Loss = 4.6368e-02, PNorm = 721.5043, GNorm = 0.1505, lr_0 = 9.9475e-04
Validation binary_cross_entropy = 0.180197
Epoch 7565
Loss = 1.4915e-01, PNorm = 721.5352, GNorm = 9.9373, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.112168
Epoch 7566
Loss = 4.8588e-02, PNorm = 721.5679, GNorm = 2.6260, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.100641
Epoch 7567
Loss = 1.0583e-02, PNorm = 721.6077, GNorm = 1.2646, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.102910
Epoch 7568
Loss = 2.1835e-02, PNorm = 721.6380, GNorm = 0.2882, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.102648
Epoch 7569
Loss = 1.8613e-02, PNorm = 721.6590, GNorm = 0.9959, lr_0 = 9.9474e-04
Loss = 2.8884e-02, PNorm = 721.6747, GNorm = 0.1943, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.094420
Epoch 7570
Loss = 1.7881e-02, PNorm = 721.6979, GNorm = 1.0690, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.101045
Epoch 7571
Loss = 2.8702e-02, PNorm = 721.7190, GNorm = 1.0455, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.100730
Epoch 7572
Loss = 1.8332e-02, PNorm = 721.7432, GNorm = 1.4769, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.109415
Epoch 7573
Loss = 3.1114e-02, PNorm = 721.7735, GNorm = 0.6562, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.103136
Epoch 7574
Loss = 5.9490e-03, PNorm = 721.8085, GNorm = 0.0269, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.108235
Epoch 7575
Loss = 6.2104e-03, PNorm = 721.8316, GNorm = 0.2119, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.115614
Epoch 7576
Loss = 1.5386e-01, PNorm = 721.8463, GNorm = 9.5082, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.089956
Epoch 7577
Loss = 5.1267e-02, PNorm = 721.8699, GNorm = 0.2013, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.085666
Epoch 7578
Loss = 2.9552e-02, PNorm = 721.8983, GNorm = 2.4365, lr_0 = 9.9474e-04
Validation binary_cross_entropy = 0.086582
Epoch 7579
Loss = 7.9262e-03, PNorm = 721.9364, GNorm = 0.3738, lr_0 = 9.9473e-04
Loss = 1.7717e-02, PNorm = 721.9681, GNorm = 0.1422, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.092415
Epoch 7580
Loss = 1.1458e-01, PNorm = 722.0098, GNorm = 0.0982, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.082629
Epoch 7581
Loss = 9.2614e-02, PNorm = 722.0707, GNorm = 3.5771, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.128126
Epoch 7582
Loss = 8.8341e-02, PNorm = 722.1200, GNorm = 5.5985, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.092145
Epoch 7583
Loss = 4.7180e-02, PNorm = 722.1620, GNorm = 2.2516, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.085402
Epoch 7584
Loss = 3.2347e-02, PNorm = 722.2093, GNorm = 0.7400, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.095273
Epoch 7585
Loss = 1.7801e-02, PNorm = 722.2507, GNorm = 0.1345, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.087472
Epoch 7586
Loss = 1.8545e-02, PNorm = 722.2922, GNorm = 0.1691, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.090949
Epoch 7587
Loss = 6.6403e-02, PNorm = 722.3280, GNorm = 0.0549, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.093385
Epoch 7588
Loss = 3.6943e-02, PNorm = 722.3627, GNorm = 0.2177, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.094323
Epoch 7589
Loss = 6.7721e-03, PNorm = 722.3909, GNorm = 0.3444, lr_0 = 9.9473e-04
Loss = 3.0142e-02, PNorm = 722.4141, GNorm = 0.0959, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.097798
Epoch 7590
Loss = 4.7693e-02, PNorm = 722.4386, GNorm = 0.4817, lr_0 = 9.9473e-04
Validation binary_cross_entropy = 0.133661
Epoch 7591
Loss = 1.5058e-02, PNorm = 722.4600, GNorm = 1.3602, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.107567
Epoch 7592
Loss = 4.3549e-02, PNorm = 722.4776, GNorm = 2.0919, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.100576
Epoch 7593
Loss = 2.1078e-02, PNorm = 722.4971, GNorm = 1.4758, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.093623
Epoch 7594
Loss = 1.1672e-01, PNorm = 722.5269, GNorm = 1.3955, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.093003
Epoch 7595
Loss = 2.9248e-02, PNorm = 722.5702, GNorm = 2.1592, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.094361
Epoch 7596
Loss = 4.1454e-02, PNorm = 722.6035, GNorm = 3.3307, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.096145
Epoch 7597
Loss = 4.0150e-02, PNorm = 722.6278, GNorm = 3.1908, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.100614
Epoch 7598
Loss = 2.9912e-02, PNorm = 722.6641, GNorm = 1.0481, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.118548
Epoch 7599
Loss = 1.5570e-03, PNorm = 722.6961, GNorm = 0.1100, lr_0 = 9.9472e-04
Loss = 4.1187e-02, PNorm = 722.7203, GNorm = 0.6750, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.115767
Epoch 7600
Loss = 1.1557e-02, PNorm = 722.7437, GNorm = 0.1259, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.118508
Epoch 7601
Loss = 9.0304e-02, PNorm = 722.7709, GNorm = 1.7703, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.092867
Epoch 7602
Loss = 5.7232e-02, PNorm = 722.8028, GNorm = 1.7456, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.083146
Epoch 7603
Loss = 1.0238e-02, PNorm = 722.8271, GNorm = 0.6488, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.082629
Epoch 7604
Loss = 1.4790e-02, PNorm = 722.8575, GNorm = 0.0967, lr_0 = 9.9472e-04
Validation binary_cross_entropy = 0.105361
Epoch 7605
Loss = 1.6362e-02, PNorm = 722.8844, GNorm = 1.6022, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.107795
Epoch 7606
Loss = 3.7980e-02, PNorm = 722.8967, GNorm = 9.4005, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.101272
Epoch 7607
Loss = 1.2427e-02, PNorm = 722.9309, GNorm = 1.0990, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.107420
Epoch 7608
Loss = 2.8754e-02, PNorm = 722.9523, GNorm = 1.2568, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.089514
Epoch 7609
Loss = 1.3180e-01, PNorm = 722.9682, GNorm = 5.9654, lr_0 = 9.9471e-04
Loss = 4.8225e-02, PNorm = 723.0364, GNorm = 0.0618, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.101681
Epoch 7610
Loss = 3.4216e-02, PNorm = 723.0819, GNorm = 0.8939, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.103045
Epoch 7611
Loss = 4.5967e-02, PNorm = 723.1163, GNorm = 0.2333, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.090555
Epoch 7612
Loss = 1.1988e-02, PNorm = 723.1562, GNorm = 0.0973, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.090414
Epoch 7613
Loss = 7.0010e-02, PNorm = 723.1967, GNorm = 2.6728, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.092038
Epoch 7614
Loss = 5.6267e-02, PNorm = 723.2262, GNorm = 3.0940, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.083719
Epoch 7615
Loss = 2.0672e-02, PNorm = 723.2526, GNorm = 0.1181, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.084071
Epoch 7616
Loss = 1.0222e-01, PNorm = 723.2869, GNorm = 1.9295, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.097996
Epoch 7617
Loss = 3.4460e-03, PNorm = 723.3345, GNorm = 0.0467, lr_0 = 9.9471e-04
Validation binary_cross_entropy = 0.103063
Epoch 7618
Loss = 3.1346e-02, PNorm = 723.3645, GNorm = 1.9707, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.092438
Epoch 7619
Loss = 3.5561e-03, PNorm = 723.3861, GNorm = 0.2933, lr_0 = 9.9470e-04
Loss = 3.5858e-02, PNorm = 723.4135, GNorm = 0.9476, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.094252
Epoch 7620
Loss = 2.4522e-02, PNorm = 723.4454, GNorm = 0.9169, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.093821
Epoch 7621
Loss = 2.6543e-02, PNorm = 723.4733, GNorm = 1.4130, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.093307
Epoch 7622
Loss = 4.6496e-02, PNorm = 723.5199, GNorm = 1.0691, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.113917
Epoch 7623
Loss = 2.5147e-02, PNorm = 723.5498, GNorm = 2.1268, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.111637
Epoch 7624
Loss = 1.8902e-02, PNorm = 723.5687, GNorm = 0.0728, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.093541
Epoch 7625
Loss = 2.0442e-02, PNorm = 723.5895, GNorm = 0.0220, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.085461
Epoch 7626
Loss = 3.4184e-02, PNorm = 723.6104, GNorm = 0.3936, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.081886
Epoch 7627
Loss = 1.1004e-02, PNorm = 723.6353, GNorm = 0.1458, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.093602
Epoch 7628
Loss = 2.0557e-02, PNorm = 723.6683, GNorm = 0.5149, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.108772
Epoch 7629
Loss = 1.0766e-03, PNorm = 723.7177, GNorm = 0.0588, lr_0 = 9.9470e-04
Loss = 2.1327e-02, PNorm = 723.7499, GNorm = 0.1896, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.124134
Epoch 7630
Loss = 6.7203e-02, PNorm = 723.7671, GNorm = 0.1968, lr_0 = 9.9470e-04
Validation binary_cross_entropy = 0.097698
Epoch 7631
Loss = 4.9497e-02, PNorm = 723.7994, GNorm = 5.4457, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.097779
Epoch 7632
Loss = 6.1235e-02, PNorm = 723.8551, GNorm = 1.4164, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.101537
Epoch 7633
Loss = 3.8881e-02, PNorm = 723.8920, GNorm = 8.1944, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.108586
Epoch 7634
Loss = 3.6148e-02, PNorm = 723.9429, GNorm = 0.4246, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.195095
Epoch 7635
Loss = 1.3697e-01, PNorm = 724.0074, GNorm = 3.6589, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.158359
Epoch 7636
Loss = 4.2117e-02, PNorm = 724.0480, GNorm = 0.3415, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.128140
Epoch 7637
Loss = 9.7501e-03, PNorm = 724.0902, GNorm = 0.6486, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.178244
Epoch 7638
Loss = 1.1396e-02, PNorm = 724.1341, GNorm = 0.2335, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.177824
Epoch 7639
Loss = 1.7688e-01, PNorm = 724.1768, GNorm = 6.3128, lr_0 = 9.9469e-04
Loss = 6.0507e-02, PNorm = 724.2194, GNorm = 2.5915, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.154005
Epoch 7640
Loss = 2.7151e-02, PNorm = 724.2601, GNorm = 0.2197, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.122791
Epoch 7641
Loss = 3.4269e-02, PNorm = 724.2914, GNorm = 0.5549, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.394590
Epoch 7642
Loss = 1.5600e+00, PNorm = 724.3406, GNorm = 2.1346, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.097042
Epoch 7643
Loss = 2.1119e-01, PNorm = 724.5454, GNorm = 9.7402, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.271964
Epoch 7644
Loss = 1.5509e-01, PNorm = 724.6900, GNorm = 2.0053, lr_0 = 9.9469e-04
Validation binary_cross_entropy = 0.108463
Epoch 7645
Loss = 1.9303e-01, PNorm = 724.7986, GNorm = 3.6499, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.125017
Epoch 7646
Loss = 9.4377e-02, PNorm = 724.9160, GNorm = 2.4027, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.108367
Epoch 7647
Loss = 6.0282e-02, PNorm = 725.0056, GNorm = 3.0676, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.143207
Epoch 7648
Loss = 1.4661e-01, PNorm = 725.0712, GNorm = 6.0890, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.135453
Epoch 7649
Loss = 2.6400e-03, PNorm = 725.1157, GNorm = 0.1560, lr_0 = 9.9468e-04
Loss = 6.8907e-02, PNorm = 725.1551, GNorm = 0.7896, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.111101
Epoch 7650
Loss = 3.4660e-02, PNorm = 725.2009, GNorm = 1.6809, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.099030
Epoch 7651
Loss = 8.3435e-02, PNorm = 725.2377, GNorm = 2.3802, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.096853
Epoch 7652
Loss = 6.2142e-02, PNorm = 725.2801, GNorm = 1.8951, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.102610
Epoch 7653
Loss = 1.0005e-01, PNorm = 725.3281, GNorm = 1.3783, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.092276
Epoch 7654
Loss = 4.2296e-02, PNorm = 725.3939, GNorm = 0.8717, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.127181
Epoch 7655
Loss = 4.6165e-02, PNorm = 725.4369, GNorm = 1.9565, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.154899
Epoch 7656
Loss = 4.1357e-02, PNorm = 725.4754, GNorm = 3.4512, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.120224
Epoch 7657
Loss = 1.8126e-02, PNorm = 725.5023, GNorm = 1.0501, lr_0 = 9.9468e-04
Validation binary_cross_entropy = 0.104102
Epoch 7658
Loss = 3.2560e-02, PNorm = 725.5369, GNorm = 0.8528, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.159181
Epoch 7659
Loss = 7.1565e-02, PNorm = 725.5786, GNorm = 2.7916, lr_0 = 9.9467e-04
Loss = 4.8275e-02, PNorm = 725.6152, GNorm = 4.8733, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.117341
Epoch 7660
Loss = 3.2691e-02, PNorm = 725.6667, GNorm = 1.8336, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.140838
Epoch 7661
Loss = 8.5499e-02, PNorm = 725.7113, GNorm = 0.2449, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.104149
Epoch 7662
Loss = 8.7127e-02, PNorm = 725.7794, GNorm = 0.3705, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.143470
Epoch 7663
Loss = 3.1274e-02, PNorm = 725.8307, GNorm = 1.0251, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.117944
Epoch 7664
Loss = 5.2851e-02, PNorm = 725.8632, GNorm = 1.9478, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.098121
Epoch 7665
Loss = 9.8623e-02, PNorm = 725.9041, GNorm = 5.0868, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.097494
Epoch 7666
Loss = 2.4988e-02, PNorm = 725.9449, GNorm = 1.9686, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.114197
Epoch 7667
Loss = 4.5443e-02, PNorm = 725.9839, GNorm = 1.7328, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.118712
Epoch 7668
Loss = 1.2877e-02, PNorm = 726.0109, GNorm = 0.2271, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.151849
Epoch 7669
Loss = 1.1014e-01, PNorm = 726.0489, GNorm = 1.4554, lr_0 = 9.9467e-04
Loss = 4.8772e-02, PNorm = 726.0766, GNorm = 0.8405, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.132257
Epoch 7670
Loss = 1.3453e-01, PNorm = 726.1115, GNorm = 0.1960, lr_0 = 9.9467e-04
Validation binary_cross_entropy = 0.089536
Epoch 7671
Loss = 2.7233e-02, PNorm = 726.1627, GNorm = 0.3122, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.108466
Epoch 7672
Loss = 2.5743e-02, PNorm = 726.2077, GNorm = 2.1277, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.094156
Epoch 7673
Loss = 6.6911e-02, PNorm = 726.2403, GNorm = 3.6891, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.096367
Epoch 7674
Loss = 2.5295e-02, PNorm = 726.2789, GNorm = 2.1401, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.108071
Epoch 7675
Loss = 5.1515e-02, PNorm = 726.3148, GNorm = 1.5071, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.106439
Epoch 7676
Loss = 9.8765e-03, PNorm = 726.3468, GNorm = 0.8836, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.100899
Epoch 7677
Loss = 1.6172e-02, PNorm = 726.3734, GNorm = 2.4078, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.102498
Epoch 7678
Loss = 6.6073e-02, PNorm = 726.3984, GNorm = 0.2592, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.097596
Epoch 7679
Loss = 2.5323e-03, PNorm = 726.4421, GNorm = 0.1806, lr_0 = 9.9466e-04
Loss = 2.7316e-02, PNorm = 726.4768, GNorm = 1.4055, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.101176
Epoch 7680
Loss = 3.2018e-02, PNorm = 726.4965, GNorm = 1.1123, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.093924
Epoch 7681
Loss = 5.3721e-02, PNorm = 726.5280, GNorm = 4.9324, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.093354
Epoch 7682
Loss = 3.9169e-02, PNorm = 726.5744, GNorm = 9.6006, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.104141
Epoch 7683
Loss = 3.5020e-02, PNorm = 726.6136, GNorm = 0.2240, lr_0 = 9.9466e-04
Validation binary_cross_entropy = 0.108096
Epoch 7684
Loss = 5.7869e-02, PNorm = 726.6587, GNorm = 1.7960, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.132485
Epoch 7685
Loss = 3.8952e-02, PNorm = 726.6947, GNorm = 0.4499, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.116594
Epoch 7686
Loss = 3.2658e-02, PNorm = 726.7219, GNorm = 0.1103, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.094994
Epoch 7687
Loss = 2.1663e-02, PNorm = 726.7447, GNorm = 0.2089, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.086343
Epoch 7688
Loss = 4.2401e-02, PNorm = 726.7914, GNorm = 1.5636, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.096488
Epoch 7689
Loss = 2.4498e-02, PNorm = 726.8344, GNorm = 2.7637, lr_0 = 9.9465e-04
Loss = 4.0059e-02, PNorm = 726.8646, GNorm = 0.2176, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.094155
Epoch 7690
Loss = 9.9557e-03, PNorm = 726.8966, GNorm = 0.0242, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.172764
Epoch 7691
Loss = 3.0561e-02, PNorm = 726.9184, GNorm = 1.3751, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.111468
Epoch 7692
Loss = 4.8127e-02, PNorm = 726.9313, GNorm = 6.9034, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.102302
Epoch 7693
Loss = 2.2795e-02, PNorm = 726.9602, GNorm = 0.3532, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.118105
Epoch 7694
Loss = 2.9585e-02, PNorm = 726.9903, GNorm = 1.3884, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.129457
Epoch 7695
Loss = 1.0356e-01, PNorm = 727.0397, GNorm = 9.4989, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.164804
Epoch 7696
Loss = 1.5380e-02, PNorm = 727.1064, GNorm = 0.2622, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.135059
Epoch 7697
Loss = 1.6031e-02, PNorm = 727.1519, GNorm = 1.5675, lr_0 = 9.9465e-04
Validation binary_cross_entropy = 0.140030
Epoch 7698
Loss = 3.0319e-03, PNorm = 727.1891, GNorm = 0.1338, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.121200
Epoch 7699
Loss = 1.6427e-02, PNorm = 727.2250, GNorm = 0.4892, lr_0 = 9.9464e-04
Loss = 7.2553e-02, PNorm = 727.2948, GNorm = 2.6940, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.203137
Epoch 7700
Loss = 1.0789e-01, PNorm = 727.3533, GNorm = 2.6138, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.127324
Epoch 7701
Loss = 7.5656e-02, PNorm = 727.4359, GNorm = 0.5947, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.167475
Epoch 7702
Loss = 5.8342e-02, PNorm = 727.5111, GNorm = 1.0671, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.137744
Epoch 7703
Loss = 2.2509e-02, PNorm = 727.5619, GNorm = 2.3917, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.098448
Epoch 7704
Loss = 8.7829e-02, PNorm = 727.6389, GNorm = 0.6927, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.118252
Epoch 7705
Loss = 1.5852e-02, PNorm = 727.7166, GNorm = 0.8829, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.165900
Epoch 7706
Loss = 8.7699e-02, PNorm = 727.7611, GNorm = 1.4359, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.089119
Epoch 7707
Loss = 1.6351e-02, PNorm = 727.8118, GNorm = 0.6404, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.097109
Epoch 7708
Loss = 5.7134e-02, PNorm = 727.8722, GNorm = 2.1903, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.093975
Epoch 7709
Loss = 4.6529e-02, PNorm = 727.9195, GNorm = 0.6095, lr_0 = 9.9464e-04
Loss = 4.7566e-02, PNorm = 727.9521, GNorm = 0.5132, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.085699
Epoch 7710
Loss = 3.4140e-02, PNorm = 727.9872, GNorm = 0.5021, lr_0 = 9.9464e-04
Validation binary_cross_entropy = 0.082348
Epoch 7711
Loss = 3.8494e-02, PNorm = 728.0320, GNorm = 1.9647, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.081413
Epoch 7712
Loss = 2.1455e-02, PNorm = 728.0811, GNorm = 2.0164, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.099878
Epoch 7713
Loss = 1.4215e-02, PNorm = 728.1300, GNorm = 0.2833, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.125514
Epoch 7714
Loss = 3.3066e-02, PNorm = 728.1675, GNorm = 0.6601, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.123063
Epoch 7715
Loss = 1.1027e-02, PNorm = 728.1978, GNorm = 0.5306, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.109873
Epoch 7716
Loss = 6.7813e-03, PNorm = 728.2339, GNorm = 0.2639, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.110659
Epoch 7717
Loss = 1.0215e-02, PNorm = 728.2768, GNorm = 1.3829, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.126444
Epoch 7718
Loss = 3.0111e-02, PNorm = 728.3230, GNorm = 1.4717, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.117740
Epoch 7719
Loss = 2.2867e-03, PNorm = 728.3587, GNorm = 0.1414, lr_0 = 9.9463e-04
Loss = 7.3580e-02, PNorm = 728.3910, GNorm = 0.1385, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.103424
Epoch 7720
Loss = 3.0180e-02, PNorm = 728.4256, GNorm = 0.3273, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.097731
Epoch 7721
Loss = 2.5850e-02, PNorm = 728.4726, GNorm = 0.2433, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.125673
Epoch 7722
Loss = 1.2189e-02, PNorm = 728.5177, GNorm = 0.2782, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.146027
Epoch 7723
Loss = 4.9978e-02, PNorm = 728.5610, GNorm = 0.4377, lr_0 = 9.9463e-04
Validation binary_cross_entropy = 0.139607
Epoch 7724
Loss = 5.4267e-02, PNorm = 728.6578, GNorm = 3.0132, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.122959
Epoch 7725
Loss = 4.9477e-02, PNorm = 728.7241, GNorm = 1.6414, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.084841
Epoch 7726
Loss = 4.2521e-02, PNorm = 728.8039, GNorm = 0.2135, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.116781
Epoch 7727
Loss = 4.1042e-02, PNorm = 728.8684, GNorm = 2.8864, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.080601
Epoch 7728
Loss = 2.8858e-02, PNorm = 728.9049, GNorm = 2.2313, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.079598
Epoch 7729
Loss = 3.5258e-02, PNorm = 728.9458, GNorm = 1.8323, lr_0 = 9.9462e-04
Loss = 6.0779e-02, PNorm = 728.9767, GNorm = 5.4004, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.075937
Epoch 7730
Loss = 4.1893e-02, PNorm = 729.0209, GNorm = 0.9726, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.078731
Epoch 7731
Loss = 3.6697e-02, PNorm = 729.0611, GNorm = 0.1928, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.081471
Epoch 7732
Loss = 2.0695e-02, PNorm = 729.0872, GNorm = 0.3175, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.080367
Epoch 7733
Loss = 2.7416e-02, PNorm = 729.1169, GNorm = 0.1560, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.081101
Epoch 7734
Loss = 5.4725e-03, PNorm = 729.1578, GNorm = 0.0949, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.099598
Epoch 7735
Loss = 5.5459e-03, PNorm = 729.1846, GNorm = 0.2109, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.099938
Epoch 7736
Loss = 4.9863e-02, PNorm = 729.2082, GNorm = 0.3022, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.081876
Epoch 7737
Loss = 1.0004e-01, PNorm = 729.2394, GNorm = 19.2807, lr_0 = 9.9462e-04
Validation binary_cross_entropy = 0.073861
Epoch 7738
Loss = 2.5110e-02, PNorm = 729.2862, GNorm = 0.7943, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.079334
Epoch 7739
Loss = 1.6947e-02, PNorm = 729.3411, GNorm = 0.5073, lr_0 = 9.9461e-04
Loss = 5.9524e-02, PNorm = 729.3828, GNorm = 2.2751, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.086112
Epoch 7740
Loss = 2.0749e-02, PNorm = 729.4157, GNorm = 1.0385, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.084548
Epoch 7741
Loss = 2.4925e-02, PNorm = 729.4520, GNorm = 3.5942, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.075082
Epoch 7742
Loss = 1.8951e-02, PNorm = 729.4885, GNorm = 2.3411, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.075850
Epoch 7743
Loss = 5.6624e-02, PNorm = 729.5165, GNorm = 6.3017, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.075829
Epoch 7744
Loss = 1.4680e-02, PNorm = 729.5660, GNorm = 1.2535, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.087163
Epoch 7745
Loss = 5.3069e-03, PNorm = 729.5998, GNorm = 0.3352, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.079634
Epoch 7746
Loss = 1.8397e-02, PNorm = 729.6220, GNorm = 0.2087, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.074713
Epoch 7747
Loss = 7.0481e-03, PNorm = 729.6558, GNorm = 0.2204, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.082382
Epoch 7748
Loss = 3.4270e-03, PNorm = 729.6949, GNorm = 0.3491, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.098912
Epoch 7749
Loss = 1.1024e-01, PNorm = 729.7217, GNorm = 3.0930, lr_0 = 9.9461e-04
Loss = 4.5250e-02, PNorm = 729.7424, GNorm = 2.8156, lr_0 = 9.9461e-04
Validation binary_cross_entropy = 0.074560
Epoch 7750
Loss = 6.4423e-02, PNorm = 729.7810, GNorm = 7.1058, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.082739
Epoch 7751
Loss = 2.7109e-02, PNorm = 729.8200, GNorm = 2.0981, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.105481
Epoch 7752
Loss = 5.8162e-02, PNorm = 729.8441, GNorm = 0.8415, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.096532
Epoch 7753
Loss = 1.6893e-02, PNorm = 729.8635, GNorm = 2.6640, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.084757
Epoch 7754
Loss = 2.8704e-02, PNorm = 729.9037, GNorm = 0.0870, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.086231
Epoch 7755
Loss = 3.0153e-02, PNorm = 729.9324, GNorm = 0.9922, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.086324
Epoch 7756
Loss = 3.9018e-02, PNorm = 729.9582, GNorm = 0.1831, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.113838
Epoch 7757
Loss = 2.5002e-02, PNorm = 729.9871, GNorm = 0.1574, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.097930
Epoch 7758
Loss = 1.5173e-02, PNorm = 730.0245, GNorm = 1.2001, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.107081
Epoch 7759
Loss = 1.4539e-02, PNorm = 730.0605, GNorm = 1.1968, lr_0 = 9.9460e-04
Loss = 1.7117e-02, PNorm = 730.0847, GNorm = 0.7406, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.147666
Epoch 7760
Loss = 9.8247e-03, PNorm = 730.1060, GNorm = 0.1799, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.212217
Epoch 7761
Loss = 4.5997e-02, PNorm = 730.1427, GNorm = 0.2296, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.138305
Epoch 7762
Loss = 5.6712e-02, PNorm = 730.1998, GNorm = 0.2780, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.117543
Epoch 7763
Loss = 3.0135e-02, PNorm = 730.2540, GNorm = 3.2983, lr_0 = 9.9460e-04
Validation binary_cross_entropy = 0.133297
Epoch 7764
Loss = 7.6891e-02, PNorm = 730.3033, GNorm = 8.1005, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.113697
Epoch 7765
Loss = 1.3008e-02, PNorm = 730.3305, GNorm = 0.8463, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.098953
Epoch 7766
Loss = 3.8119e-02, PNorm = 730.3522, GNorm = 0.1397, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.105196
Epoch 7767
Loss = 2.4322e-02, PNorm = 730.3878, GNorm = 0.9101, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.120732
Epoch 7768
Loss = 2.3287e-02, PNorm = 730.4110, GNorm = 0.9830, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.108195
Epoch 7769
Loss = 9.5626e-03, PNorm = 730.4268, GNorm = 1.1747, lr_0 = 9.9459e-04
Loss = 1.7980e-02, PNorm = 730.4435, GNorm = 0.1020, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.117626
Epoch 7770
Loss = 1.3923e-02, PNorm = 730.4638, GNorm = 4.3750, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.123045
Epoch 7771
Loss = 8.2194e-02, PNorm = 730.4922, GNorm = 0.6438, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.107194
Epoch 7772
Loss = 3.1121e-02, PNorm = 730.5424, GNorm = 5.5790, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.096335
Epoch 7773
Loss = 1.8769e-02, PNorm = 730.5945, GNorm = 0.2195, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.092163
Epoch 7774
Loss = 2.9255e-02, PNorm = 730.6341, GNorm = 0.1436, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.087629
Epoch 7775
Loss = 6.2717e-02, PNorm = 730.6565, GNorm = 0.1888, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.105564
Epoch 7776
Loss = 5.3282e-02, PNorm = 730.6944, GNorm = 0.2028, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.103796
Epoch 7777
Loss = 4.4760e-02, PNorm = 730.7380, GNorm = 0.2208, lr_0 = 9.9459e-04
Validation binary_cross_entropy = 0.103562
Epoch 7778
Loss = 1.1949e-02, PNorm = 730.7932, GNorm = 1.0107, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.106213
Epoch 7779
Loss = 7.8876e-04, PNorm = 730.8376, GNorm = 0.0568, lr_0 = 9.9458e-04
Loss = 9.1269e-01, PNorm = 730.8961, GNorm = 0.8736, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.073366
Epoch 7780
Loss = 1.5889e-01, PNorm = 731.0231, GNorm = 24.3344, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.178866
Epoch 7781
Loss = 1.5511e-01, PNorm = 731.1124, GNorm = 2.0621, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.083750
Epoch 7782
Loss = 1.5833e-01, PNorm = 731.2179, GNorm = 4.3376, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.097766
Epoch 7783
Loss = 6.3415e-02, PNorm = 731.2976, GNorm = 4.6653, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.082369
Epoch 7784
Loss = 2.2184e-02, PNorm = 731.3627, GNorm = 2.3687, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.093892
Epoch 7785
Loss = 3.2023e-02, PNorm = 731.3969, GNorm = 0.5171, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.084405
Epoch 7786
Loss = 3.4820e-02, PNorm = 731.4270, GNorm = 0.0826, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.098475
Epoch 7787
Loss = 3.8866e-02, PNorm = 731.4804, GNorm = 0.4932, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.092265
Epoch 7788
Loss = 8.3854e-03, PNorm = 731.5181, GNorm = 0.1783, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.083487
Epoch 7789
Loss = 2.6936e-02, PNorm = 731.5566, GNorm = 1.1907, lr_0 = 9.9458e-04
Loss = 6.3273e-02, PNorm = 731.5948, GNorm = 3.6533, lr_0 = 9.9458e-04
Validation binary_cross_entropy = 0.090970
Epoch 7790
Loss = 3.3307e-02, PNorm = 731.6259, GNorm = 1.8479, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.076272
Epoch 7791
Loss = 3.2634e-02, PNorm = 731.6648, GNorm = 0.3621, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.078294
Epoch 7792
Loss = 3.1610e-02, PNorm = 731.7048, GNorm = 2.4582, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.088484
Epoch 7793
Loss = 4.9501e-02, PNorm = 731.7408, GNorm = 2.3327, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.092891
Epoch 7794
Loss = 4.8121e-02, PNorm = 731.7749, GNorm = 4.2890, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.093282
Epoch 7795
Loss = 5.7080e-03, PNorm = 731.8094, GNorm = 0.0660, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.101283
Epoch 7796
Loss = 1.4197e-02, PNorm = 731.8413, GNorm = 1.7118, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.106965
Epoch 7797
Loss = 1.4374e-02, PNorm = 731.8623, GNorm = 0.2924, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.097639
Epoch 7798
Loss = 3.5448e-03, PNorm = 731.8835, GNorm = 0.2264, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.091415
Epoch 7799
Loss = 8.1195e-03, PNorm = 731.9079, GNorm = 0.5306, lr_0 = 9.9457e-04
Loss = 4.1844e-02, PNorm = 731.9421, GNorm = 0.1453, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.094938
Epoch 7800
Loss = 8.4247e-02, PNorm = 731.9811, GNorm = 0.1570, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.120593
Epoch 7801
Loss = 3.1989e-02, PNorm = 732.0076, GNorm = 1.9766, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.088372
Epoch 7802
Loss = 6.8813e-02, PNorm = 732.0368, GNorm = 1.3654, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.084953
Epoch 7803
Loss = 7.9207e-02, PNorm = 732.0950, GNorm = 0.1509, lr_0 = 9.9457e-04
Validation binary_cross_entropy = 0.091572
Epoch 7804
Loss = 4.8996e-02, PNorm = 732.1601, GNorm = 3.4490, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.090833
Epoch 7805
Loss = 5.2284e-02, PNorm = 732.2147, GNorm = 0.2039, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.092168
Epoch 7806
Loss = 3.3126e-02, PNorm = 732.2570, GNorm = 0.9519, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.094289
Epoch 7807
Loss = 5.1413e-02, PNorm = 732.2917, GNorm = 0.2477, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.108469
Epoch 7808
Loss = 3.2074e-02, PNorm = 732.3265, GNorm = 0.3973, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.145365
Epoch 7809
Loss = 1.4997e-01, PNorm = 732.3476, GNorm = 2.8940, lr_0 = 9.9456e-04
Loss = 7.0253e-02, PNorm = 732.3623, GNorm = 4.3831, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.122208
Epoch 7810
Loss = 4.8909e-02, PNorm = 732.3858, GNorm = 3.1742, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.079970
Epoch 7811
Loss = 7.1519e-02, PNorm = 732.4319, GNorm = 1.5265, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.086196
Epoch 7812
Loss = 2.4868e-02, PNorm = 732.4800, GNorm = 3.2272, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.093677
Epoch 7813
Loss = 1.7278e-02, PNorm = 732.5127, GNorm = 0.9235, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.087169
Epoch 7814
Loss = 2.7771e-02, PNorm = 732.5387, GNorm = 3.3508, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.091158
Epoch 7815
Loss = 9.1811e-03, PNorm = 732.5691, GNorm = 0.0762, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.101727
Epoch 7816
Loss = 1.2900e-02, PNorm = 732.5971, GNorm = 0.5200, lr_0 = 9.9456e-04
Validation binary_cross_entropy = 0.119403
Epoch 7817
Loss = 1.9837e-03, PNorm = 732.6397, GNorm = 0.0948, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.133212
Epoch 7818
Loss = 7.5684e-03, PNorm = 732.6807, GNorm = 3.0306, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.115950
Epoch 7819
Loss = 3.1586e-02, PNorm = 732.7221, GNorm = 1.4067, lr_0 = 9.9455e-04
Loss = 7.4852e-02, PNorm = 732.7486, GNorm = 1.9433, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.090949
Epoch 7820
Loss = 4.3969e-02, PNorm = 732.8113, GNorm = 4.1696, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.095049
Epoch 7821
Loss = 2.8257e-02, PNorm = 732.8714, GNorm = 0.7842, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.120365
Epoch 7822
Loss = 4.8185e-02, PNorm = 732.9052, GNorm = 1.7968, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.104563
Epoch 7823
Loss = 2.6075e-02, PNorm = 732.9299, GNorm = 0.9137, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.098328
Epoch 7824
Loss = 3.3065e-02, PNorm = 732.9680, GNorm = 0.1682, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.099146
Epoch 7825
Loss = 5.5835e-02, PNorm = 733.0259, GNorm = 1.4307, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.117464
Epoch 7826
Loss = 2.1496e-02, PNorm = 733.0618, GNorm = 0.2387, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.097289
Epoch 7827
Loss = 6.8890e-02, PNorm = 733.0949, GNorm = 1.0416, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.085551
Epoch 7828
Loss = 9.3373e-02, PNorm = 733.1536, GNorm = 5.7685, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.140016
Epoch 7829
Loss = 3.7536e-02, PNorm = 733.2314, GNorm = 2.1914, lr_0 = 9.9455e-04
Loss = 7.0821e-02, PNorm = 733.2640, GNorm = 2.0112, lr_0 = 9.9455e-04
Validation binary_cross_entropy = 0.102543
Epoch 7830
Loss = 5.1449e-02, PNorm = 733.2947, GNorm = 3.6326, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.099067
Epoch 7831
Loss = 4.7651e-02, PNorm = 733.3405, GNorm = 0.8983, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.114042
Epoch 7832
Loss = 6.0117e-02, PNorm = 733.3791, GNorm = 2.4911, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.086611
Epoch 7833
Loss = 3.3631e-02, PNorm = 733.4263, GNorm = 0.8924, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.096731
Epoch 7834
Loss = 1.9966e-02, PNorm = 733.4719, GNorm = 2.8170, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.104958
Epoch 7835
Loss = 1.3940e-02, PNorm = 733.5003, GNorm = 1.2821, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.099419
Epoch 7836
Loss = 4.5359e-03, PNorm = 733.5274, GNorm = 0.2580, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.123252
Epoch 7837
Loss = 6.2211e-03, PNorm = 733.5604, GNorm = 2.8986, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.138917
Epoch 7838
Loss = 8.2491e-02, PNorm = 733.5854, GNorm = 1.1792, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.101873
Epoch 7839
Loss = 5.8450e-02, PNorm = 733.6194, GNorm = 3.6801, lr_0 = 9.9454e-04
Loss = 2.5718e-02, PNorm = 733.6660, GNorm = 1.4054, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.122318
Epoch 7840
Loss = 2.3907e-02, PNorm = 733.6926, GNorm = 0.2783, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.122114
Epoch 7841
Loss = 7.9482e-02, PNorm = 733.7076, GNorm = 0.6315, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.079636
Epoch 7842
Loss = 3.7981e-02, PNorm = 733.7452, GNorm = 0.5093, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.081141
Epoch 7843
Loss = 3.4177e-02, PNorm = 733.7834, GNorm = 2.5106, lr_0 = 9.9454e-04
Validation binary_cross_entropy = 0.082886
Epoch 7844
Loss = 1.2485e-02, PNorm = 733.8121, GNorm = 0.3326, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.092474
Epoch 7845
Loss = 4.9538e-02, PNorm = 733.8531, GNorm = 0.7455, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.141799
Epoch 7846
Loss = 4.8560e-02, PNorm = 733.8804, GNorm = 0.2272, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.128007
Epoch 7847
Loss = 2.5920e-02, PNorm = 733.8999, GNorm = 1.4567, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.120126
Epoch 7848
Loss = 7.3821e-02, PNorm = 733.9214, GNorm = 3.2031, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.144183
Epoch 7849
Loss = 3.3742e-02, PNorm = 733.9522, GNorm = 2.1114, lr_0 = 9.9453e-04
Loss = 7.7304e-03, PNorm = 733.9734, GNorm = 0.0650, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.123686
Epoch 7850
Loss = 2.3958e-02, PNorm = 733.9981, GNorm = 0.8955, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.112638
Epoch 7851
Loss = 3.4900e-02, PNorm = 734.0438, GNorm = 2.6963, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.148512
Epoch 7852
Loss = 3.4692e-02, PNorm = 734.0953, GNorm = 1.0399, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.137422
Epoch 7853
Loss = 6.4247e-02, PNorm = 734.1285, GNorm = 3.7087, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.112204
Epoch 7854
Loss = 9.4725e-03, PNorm = 734.1737, GNorm = 0.0904, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.126787
Epoch 7855
Loss = 2.3455e-02, PNorm = 734.2139, GNorm = 1.4039, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.143432
Epoch 7856
Loss = 3.3503e-03, PNorm = 734.2607, GNorm = 0.1415, lr_0 = 9.9453e-04
Validation binary_cross_entropy = 0.141962
Epoch 7857
Loss = 1.6226e-02, PNorm = 734.3179, GNorm = 0.3496, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.135965
Epoch 7858
Loss = 6.2596e-02, PNorm = 734.3655, GNorm = 7.8712, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.155978
Epoch 7859
Loss = 1.0940e-03, PNorm = 734.3986, GNorm = 0.1936, lr_0 = 9.9452e-04
Loss = 1.3437e-02, PNorm = 734.4173, GNorm = 0.3964, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.142924
Epoch 7860
Loss = 5.7450e-02, PNorm = 734.4334, GNorm = 8.3744, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.113348
Epoch 7861
Loss = 5.6099e-02, PNorm = 734.4650, GNorm = 0.4836, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.095051
Epoch 7862
Loss = 2.5601e-02, PNorm = 734.5313, GNorm = 1.2134, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.108449
Epoch 7863
Loss = 1.1138e-02, PNorm = 734.5884, GNorm = 0.8719, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.119665
Epoch 7864
Loss = 2.6529e-02, PNorm = 734.6231, GNorm = 1.0430, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.111879
Epoch 7865
Loss = 5.2625e-02, PNorm = 734.6647, GNorm = 2.1948, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.167023
Epoch 7866
Loss = 3.3117e-02, PNorm = 734.7150, GNorm = 0.4158, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.153911
Epoch 7867
Loss = 1.9318e-02, PNorm = 734.7455, GNorm = 1.5094, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.159314
Epoch 7868
Loss = 4.1135e-02, PNorm = 734.7772, GNorm = 1.8071, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.157111
Epoch 7869
Loss = 1.0626e-02, PNorm = 734.8024, GNorm = 0.6074, lr_0 = 9.9452e-04
Loss = 7.6839e-02, PNorm = 734.8327, GNorm = 0.1686, lr_0 = 9.9452e-04
Validation binary_cross_entropy = 0.114756
Epoch 7870
Loss = 7.4545e-02, PNorm = 734.8832, GNorm = 8.2797, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.103691
Epoch 7871
Loss = 3.2387e-02, PNorm = 734.9554, GNorm = 1.1042, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.130573
Epoch 7872
Loss = 2.6795e-02, PNorm = 734.9983, GNorm = 0.6967, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.119079
Epoch 7873
Loss = 2.3576e-02, PNorm = 735.0188, GNorm = 0.2954, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.105714
Epoch 7874
Loss = 5.3660e-02, PNorm = 735.0557, GNorm = 0.2790, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.123196
Epoch 7875
Loss = 5.6526e-02, PNorm = 735.1056, GNorm = 6.0546, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.118139
Epoch 7876
Loss = 1.3103e-02, PNorm = 735.1447, GNorm = 0.9390, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.105520
Epoch 7877
Loss = 4.8868e-02, PNorm = 735.1780, GNorm = 1.2022, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.119177
Epoch 7878
Loss = 6.3960e-03, PNorm = 735.2243, GNorm = 0.1525, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.130479
Epoch 7879
Loss = 3.5837e-02, PNorm = 735.2616, GNorm = 0.5898, lr_0 = 9.9451e-04
Loss = 3.8137e-02, PNorm = 735.2937, GNorm = 0.0318, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.127897
Epoch 7880
Loss = 3.1641e-02, PNorm = 735.3251, GNorm = 0.4044, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.107951
Epoch 7881
Loss = 2.2028e-02, PNorm = 735.3574, GNorm = 0.2979, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.100035
Epoch 7882
Loss = 5.3698e-03, PNorm = 735.3898, GNorm = 0.5050, lr_0 = 9.9451e-04
Validation binary_cross_entropy = 0.107029
Epoch 7883
Loss = 3.2345e-02, PNorm = 735.4215, GNorm = 2.9193, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.113724
Epoch 7884
Loss = 3.6576e-02, PNorm = 735.4498, GNorm = 2.0182, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.113980
Epoch 7885
Loss = 9.6063e-02, PNorm = 735.4835, GNorm = 0.2278, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.126440
Epoch 7886
Loss = 5.5213e-02, PNorm = 735.5143, GNorm = 2.6358, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.100660
Epoch 7887
Loss = 4.3375e-02, PNorm = 735.5383, GNorm = 0.4982, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.094868
Epoch 7888
Loss = 9.7906e-03, PNorm = 735.5656, GNorm = 0.4824, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.117415
Epoch 7889
Loss = 6.8477e-02, PNorm = 735.5971, GNorm = 4.1295, lr_0 = 9.9450e-04
Loss = 3.2517e-02, PNorm = 735.6322, GNorm = 11.8748, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.151151
Epoch 7890
Loss = 2.3352e-02, PNorm = 735.6733, GNorm = 0.4007, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.111071
Epoch 7891
Loss = 2.0079e-02, PNorm = 735.7143, GNorm = 0.2980, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.105631
Epoch 7892
Loss = 2.3044e-02, PNorm = 735.7547, GNorm = 5.9635, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.115652
Epoch 7893
Loss = 2.1184e-02, PNorm = 735.7899, GNorm = 0.1845, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.121041
Epoch 7894
Loss = 2.0743e-02, PNorm = 735.8290, GNorm = 0.6633, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.127664
Epoch 7895
Loss = 2.9983e-02, PNorm = 735.8750, GNorm = 0.0326, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.130743
Epoch 7896
Loss = 5.4827e-03, PNorm = 735.9106, GNorm = 0.2429, lr_0 = 9.9450e-04
Validation binary_cross_entropy = 0.141176
Epoch 7897
Loss = 2.7419e-01, PNorm = 735.9375, GNorm = 20.9044, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.093544
Epoch 7898
Loss = 5.4065e-02, PNorm = 735.9919, GNorm = 1.2835, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.111279
Epoch 7899
Loss = 4.5116e-03, PNorm = 736.0571, GNorm = 0.2547, lr_0 = 9.9449e-04
Loss = 3.1467e-02, PNorm = 736.0865, GNorm = 0.1791, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.100776
Epoch 7900
Loss = 3.0934e-02, PNorm = 736.1044, GNorm = 0.9231, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.088128
Epoch 7901
Loss = 4.0001e-02, PNorm = 736.1319, GNorm = 0.9233, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.101007
Epoch 7902
Loss = 1.3678e-02, PNorm = 736.1581, GNorm = 1.9696, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.095505
Epoch 7903
Loss = 1.9984e-02, PNorm = 736.1746, GNorm = 0.5538, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.091969
Epoch 7904
Loss = 1.5156e-02, PNorm = 736.1922, GNorm = 0.0928, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.097252
Epoch 7905
Loss = 1.6231e-02, PNorm = 736.2203, GNorm = 2.3172, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.115860
Epoch 7906
Loss = 7.1028e-03, PNorm = 736.2507, GNorm = 0.8733, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.143919
Epoch 7907
Loss = 1.1284e-01, PNorm = 736.2690, GNorm = 0.0466, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.109961
Epoch 7908
Loss = 5.2497e-02, PNorm = 736.2784, GNorm = 2.3608, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.090689
Epoch 7909
Loss = 1.5174e-02, PNorm = 736.2852, GNorm = 0.9714, lr_0 = 9.9449e-04
Loss = 3.0390e-02, PNorm = 736.3082, GNorm = 0.4383, lr_0 = 9.9449e-04
Validation binary_cross_entropy = 0.082692
Epoch 7910
Loss = 6.0996e-02, PNorm = 736.3309, GNorm = 2.2400, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.084796
Epoch 7911
Loss = 7.6825e-03, PNorm = 736.3659, GNorm = 0.1101, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.142193
Epoch 7912
Loss = 2.7756e-02, PNorm = 736.3855, GNorm = 0.0743, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.096527
Epoch 7913
Loss = 1.2385e-02, PNorm = 736.4213, GNorm = 0.2328, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.106945
Epoch 7914
Loss = 9.5698e-03, PNorm = 736.4462, GNorm = 0.9904, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.111740
Epoch 7915
Loss = 2.9752e-02, PNorm = 736.4624, GNorm = 0.1455, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.109065
Epoch 7916
Loss = 2.4554e-02, PNorm = 736.4849, GNorm = 0.1519, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.120273
Epoch 7917
Loss = 1.1262e-02, PNorm = 736.5235, GNorm = 0.5102, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.096865
Epoch 7918
Loss = 2.0220e-02, PNorm = 736.5776, GNorm = 0.3907, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.102617
Epoch 7919
Loss = 8.2909e-03, PNorm = 736.6364, GNorm = 0.2299, lr_0 = 9.9448e-04
Loss = 9.5435e-03, PNorm = 736.6818, GNorm = 1.3552, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.121203
Epoch 7920
Loss = 2.2102e-02, PNorm = 736.7132, GNorm = 0.1031, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.134605
Epoch 7921
Loss = 2.1396e-02, PNorm = 736.7338, GNorm = 2.9906, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.138585
Epoch 7922
Loss = 4.2682e-02, PNorm = 736.7521, GNorm = 7.9814, lr_0 = 9.9448e-04
Validation binary_cross_entropy = 0.149110
Epoch 7923
Loss = 1.9271e-02, PNorm = 736.7700, GNorm = 0.0578, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.158629
Epoch 7924
Loss = 3.9617e-03, PNorm = 736.7863, GNorm = 0.1969, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.176323
Epoch 7925
Loss = 3.7229e-02, PNorm = 736.7911, GNorm = 0.3213, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.113213
Epoch 7926
Loss = 1.7066e-02, PNorm = 736.8200, GNorm = 0.0962, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.146986
Epoch 7927
Loss = 7.0515e-02, PNorm = 736.8663, GNorm = 4.1652, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.085250
Epoch 7928
Loss = 3.8237e-02, PNorm = 736.8910, GNorm = 0.5335, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.085858
Epoch 7929
Loss = 1.1079e-02, PNorm = 736.9345, GNorm = 0.4240, lr_0 = 9.9447e-04
Loss = 4.7323e-02, PNorm = 736.9754, GNorm = 2.6643, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.086350
Epoch 7930
Loss = 1.4920e-02, PNorm = 737.0096, GNorm = 0.0487, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.086012
Epoch 7931
Loss = 1.1666e-02, PNorm = 737.0462, GNorm = 0.0960, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.097596
Epoch 7932
Loss = 2.2322e-02, PNorm = 737.0711, GNorm = 6.0879, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.091452
Epoch 7933
Loss = 1.4870e-02, PNorm = 737.0842, GNorm = 0.3050, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.089158
Epoch 7934
Loss = 1.8149e-02, PNorm = 737.1228, GNorm = 1.0633, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.096562
Epoch 7935
Loss = 3.8021e-02, PNorm = 737.1605, GNorm = 1.1057, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.095737
Epoch 7936
Loss = 1.4751e-02, PNorm = 737.2039, GNorm = 0.0717, lr_0 = 9.9447e-04
Validation binary_cross_entropy = 0.100106
Epoch 7937
Loss = 2.9891e-03, PNorm = 737.2390, GNorm = 0.0160, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.092352
Epoch 7938
Loss = 2.0616e-02, PNorm = 737.2586, GNorm = 1.9666, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.089710
Epoch 7939
Loss = 4.0635e-02, PNorm = 737.2865, GNorm = 1.1606, lr_0 = 9.9446e-04
Loss = 2.0095e-02, PNorm = 737.3261, GNorm = 1.4661, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.107544
Epoch 7940
Loss = 5.1459e-02, PNorm = 737.3497, GNorm = 0.1603, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.093395
Epoch 7941
Loss = 2.3694e-02, PNorm = 737.3722, GNorm = 0.8723, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.088811
Epoch 7942
Loss = 1.4868e-02, PNorm = 737.4026, GNorm = 0.4700, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.090364
Epoch 7943
Loss = 1.7069e-02, PNorm = 737.4314, GNorm = 2.1289, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.095172
Epoch 7944
Loss = 8.0053e-03, PNorm = 737.4644, GNorm = 0.5656, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.100298
Epoch 7945
Loss = 1.7216e-02, PNorm = 737.4863, GNorm = 0.0137, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.098900
Epoch 7946
Loss = 1.6474e-02, PNorm = 737.5070, GNorm = 0.5166, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.105355
Epoch 7947
Loss = 4.5017e-03, PNorm = 737.5310, GNorm = 0.1884, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.107561
Epoch 7948
Loss = 2.9782e-03, PNorm = 737.5486, GNorm = 0.0582, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.103487
Epoch 7949
Loss = 5.4369e-03, PNorm = 737.5910, GNorm = 0.6889, lr_0 = 9.9446e-04
Loss = 3.2832e-02, PNorm = 737.6435, GNorm = 0.0227, lr_0 = 9.9446e-04
Validation binary_cross_entropy = 0.288661
Epoch 7950
Loss = 9.8674e-02, PNorm = 737.6813, GNorm = 1.5257, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.092102
Epoch 7951
Loss = 2.1539e-02, PNorm = 737.7273, GNorm = 0.6244, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.084245
Epoch 7952
Loss = 2.4191e-02, PNorm = 737.7850, GNorm = 0.2051, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.105393
Epoch 7953
Loss = 8.1992e-03, PNorm = 737.8408, GNorm = 0.1876, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.101318
Epoch 7954
Loss = 6.1480e-03, PNorm = 737.8894, GNorm = 0.0580, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.108193
Epoch 7955
Loss = 1.3563e-02, PNorm = 737.9284, GNorm = 0.1658, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.154949
Epoch 7956
Loss = 1.0168e-01, PNorm = 737.9552, GNorm = 1.3890, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.108250
Epoch 7957
Loss = 2.7272e-02, PNorm = 737.9891, GNorm = 0.6340, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.116083
Epoch 7958
Loss = 5.5259e-02, PNorm = 738.0405, GNorm = 0.0775, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.132275
Epoch 7959
Loss = 1.0315e-02, PNorm = 738.0992, GNorm = 1.1924, lr_0 = 9.9445e-04
Loss = 1.1218e-02, PNorm = 738.1323, GNorm = 0.0994, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.133107
Epoch 7960
Loss = 1.2449e-02, PNorm = 738.1537, GNorm = 1.4943, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.118337
Epoch 7961
Loss = 1.7857e-02, PNorm = 738.1813, GNorm = 2.0482, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.114632
Epoch 7962
Loss = 4.8693e-02, PNorm = 738.2142, GNorm = 0.1314, lr_0 = 9.9445e-04
Validation binary_cross_entropy = 0.091812
Epoch 7963
Loss = 1.7017e-02, PNorm = 738.2503, GNorm = 5.8123, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.122984
Epoch 7964
Loss = 6.7733e-03, PNorm = 738.2855, GNorm = 2.1275, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.271879
Epoch 7965
Loss = 1.8168e-01, PNorm = 738.2987, GNorm = 6.2916, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.088838
Epoch 7966
Loss = 7.2409e-02, PNorm = 738.3144, GNorm = 3.7518, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.084065
Epoch 7967
Loss = 1.7780e-02, PNorm = 738.3834, GNorm = 0.7059, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.099549
Epoch 7968
Loss = 9.6286e-03, PNorm = 738.4386, GNorm = 0.4467, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.104582
Epoch 7969
Loss = 1.5543e-02, PNorm = 738.4777, GNorm = 0.7982, lr_0 = 9.9444e-04
Loss = 4.5538e-02, PNorm = 738.5290, GNorm = 0.1494, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.134752
Epoch 7970
Loss = 2.9515e-02, PNorm = 738.5875, GNorm = 2.3249, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.108972
Epoch 7971
Loss = 2.0044e-02, PNorm = 738.6369, GNorm = 0.2397, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.096409
Epoch 7972
Loss = 7.0732e-02, PNorm = 738.7214, GNorm = 5.5442, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.130240
Epoch 7973
Loss = 8.3557e-02, PNorm = 738.8259, GNorm = 2.2863, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.084913
Epoch 7974
Loss = 1.0609e-01, PNorm = 738.9099, GNorm = 4.8054, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.116131
Epoch 7975
Loss = 1.3406e-01, PNorm = 738.9843, GNorm = 2.1123, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.111459
Epoch 7976
Loss = 4.6551e-02, PNorm = 739.0392, GNorm = 1.9432, lr_0 = 9.9444e-04
Validation binary_cross_entropy = 0.080408
Epoch 7977
Loss = 4.9309e-02, PNorm = 739.1059, GNorm = 2.8737, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.100185
Epoch 7978
Loss = 9.5164e-03, PNorm = 739.1765, GNorm = 0.4538, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.089849
Epoch 7979
Loss = 4.5767e-02, PNorm = 739.2193, GNorm = 2.8036, lr_0 = 9.9443e-04
Loss = 4.0470e-02, PNorm = 739.2608, GNorm = 1.6626, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.089911
Epoch 7980
Loss = 7.8504e-02, PNorm = 739.3029, GNorm = 2.8836, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.090278
Epoch 7981
Loss = 2.5401e-02, PNorm = 739.3462, GNorm = 1.3175, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.088746
Epoch 7982
Loss = 2.8668e-02, PNorm = 739.3860, GNorm = 1.3610, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.096759
Epoch 7983
Loss = 1.8401e-02, PNorm = 739.4222, GNorm = 0.1168, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.101221
Epoch 7984
Loss = 1.1963e-02, PNorm = 739.4473, GNorm = 0.1336, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.110160
Epoch 7985
Loss = 5.3986e-02, PNorm = 739.4832, GNorm = 1.1422, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.121306
Epoch 7986
Loss = 6.6378e-02, PNorm = 739.5387, GNorm = 4.4523, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.120761
Epoch 7987
Loss = 2.4081e-02, PNorm = 739.5870, GNorm = 1.6187, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.098866
Epoch 7988
Loss = 1.1629e-02, PNorm = 739.6340, GNorm = 0.7096, lr_0 = 9.9443e-04
Validation binary_cross_entropy = 0.097272
Epoch 7989
Loss = 2.7635e-02, PNorm = 739.6879, GNorm = 1.1556, lr_0 = 9.9443e-04
Loss = 3.7371e-02, PNorm = 739.7325, GNorm = 1.5420, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.116101
Epoch 7990
Loss = 3.0075e-02, PNorm = 739.7683, GNorm = 0.7564, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.105499
Epoch 7991
Loss = 6.0064e-03, PNorm = 739.8079, GNorm = 0.1005, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.109562
Epoch 7992
Loss = 1.4196e-02, PNorm = 739.8393, GNorm = 0.1061, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.108006
Epoch 7993
Loss = 7.5866e-02, PNorm = 739.8639, GNorm = 2.3632, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.102896
Epoch 7994
Loss = 1.7272e-02, PNorm = 739.8941, GNorm = 0.0839, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.104969
Epoch 7995
Loss = 2.7149e-02, PNorm = 739.9243, GNorm = 1.2217, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.114827
Epoch 7996
Loss = 2.2319e-02, PNorm = 739.9644, GNorm = 0.9531, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.112589
Epoch 7997
Loss = 8.9723e-03, PNorm = 740.0042, GNorm = 0.1214, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.110976
Epoch 7998
Loss = 7.5892e-03, PNorm = 740.0523, GNorm = 0.6635, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.102582
Epoch 7999
Loss = 6.0036e-03, PNorm = 740.1006, GNorm = 0.3598, lr_0 = 9.9442e-04
Loss = 4.7011e-02, PNorm = 740.1580, GNorm = 0.8897, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.095855
Epoch 8000
Loss = 4.6670e-02, PNorm = 740.2232, GNorm = 0.8529, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.107334
Epoch 8001
Loss = 2.1768e-02, PNorm = 740.2793, GNorm = 2.2889, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.116937
Epoch 8002
Loss = 2.7583e-02, PNorm = 740.3196, GNorm = 1.5527, lr_0 = 9.9442e-04
Validation binary_cross_entropy = 0.098974
Epoch 8003
Loss = 1.7425e-02, PNorm = 740.3576, GNorm = 0.5053, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.106359
Epoch 8004
Loss = 3.6403e-02, PNorm = 740.4036, GNorm = 0.6803, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.106592
Epoch 8005
Loss = 8.8151e-02, PNorm = 740.4697, GNorm = 1.9777, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.121010
Epoch 8006
Loss = 1.8347e-02, PNorm = 740.5541, GNorm = 0.7201, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.126188
Epoch 8007
Loss = 1.9304e-02, PNorm = 740.6077, GNorm = 1.8536, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.117941
Epoch 8008
Loss = 3.5819e-02, PNorm = 740.6487, GNorm = 2.1337, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.119128
Epoch 8009
Loss = 1.2819e-01, PNorm = 740.7060, GNorm = 7.8724, lr_0 = 9.9441e-04
Loss = 7.0214e-02, PNorm = 740.8199, GNorm = 6.1743, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.140349
Epoch 8010
Loss = 1.3182e-01, PNorm = 740.9505, GNorm = 0.6401, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.132342
Epoch 8011
Loss = 8.7513e-02, PNorm = 741.0579, GNorm = 0.7482, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.129825
Epoch 8012
Loss = 9.3128e-02, PNorm = 741.1588, GNorm = 13.3254, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.134059
Epoch 8013
Loss = 5.0947e-02, PNorm = 741.2542, GNorm = 1.6421, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.135667
Epoch 8014
Loss = 8.9325e-02, PNorm = 741.3460, GNorm = 1.6862, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.162860
Epoch 8015
Loss = 8.1219e-02, PNorm = 741.4142, GNorm = 0.8824, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.102880
Epoch 8016
Loss = 1.3071e-01, PNorm = 741.4814, GNorm = 5.8192, lr_0 = 9.9441e-04
Validation binary_cross_entropy = 0.114859
Epoch 8017
Loss = 1.9793e-02, PNorm = 741.5651, GNorm = 1.9113, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.089503
Epoch 8018
Loss = 8.5401e-02, PNorm = 741.6450, GNorm = 0.4643, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.121456
Epoch 8019
Loss = 8.2877e-02, PNorm = 741.7135, GNorm = 2.5745, lr_0 = 9.9440e-04
Loss = 7.7782e-01, PNorm = 741.7673, GNorm = 3.4117, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.073676
Epoch 8020
Loss = 1.4325e-01, PNorm = 741.8665, GNorm = 6.2099, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.216253
Epoch 8021
Loss = 1.9204e-01, PNorm = 741.9433, GNorm = 6.2805, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.072071
Epoch 8022
Loss = 3.7059e-02, PNorm = 742.0154, GNorm = 4.0801, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.092551
Epoch 8023
Loss = 2.4820e-02, PNorm = 742.0597, GNorm = 0.8963, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.090271
Epoch 8024
Loss = 2.8686e-02, PNorm = 742.0894, GNorm = 0.0708, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.122131
Epoch 8025
Loss = 5.9689e-02, PNorm = 742.1175, GNorm = 0.1823, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.096773
Epoch 8026
Loss = 6.5394e-03, PNorm = 742.1624, GNorm = 0.2304, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.090611
Epoch 8027
Loss = 2.1629e-02, PNorm = 742.2104, GNorm = 0.7260, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.079376
Epoch 8028
Loss = 2.4669e-02, PNorm = 742.2415, GNorm = 1.4377, lr_0 = 9.9440e-04
Validation binary_cross_entropy = 0.084610
Epoch 8029
Loss = 2.7561e-02, PNorm = 742.2694, GNorm = 1.4866, lr_0 = 9.9440e-04
Loss = 2.7345e-02, PNorm = 742.2938, GNorm = 3.9814, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.100183
Epoch 8030
Loss = 5.5759e-02, PNorm = 742.3221, GNorm = 0.0226, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.116941
Epoch 8031
Loss = 1.2745e-02, PNorm = 742.3447, GNorm = 0.4213, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.101598
Epoch 8032
Loss = 2.2726e-02, PNorm = 742.3717, GNorm = 0.8306, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.118209
Epoch 8033
Loss = 3.1554e-02, PNorm = 742.3991, GNorm = 0.1481, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.112389
Epoch 8034
Loss = 3.6582e-02, PNorm = 742.4222, GNorm = 1.2633, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.109951
Epoch 8035
Loss = 3.8576e-02, PNorm = 742.4699, GNorm = 0.0438, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.098573
Epoch 8036
Loss = 7.4757e-02, PNorm = 742.5279, GNorm = 3.8761, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.114417
Epoch 8037
Loss = 1.8111e-01, PNorm = 742.5757, GNorm = 1.0320, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.079653
Epoch 8038
Loss = 5.1867e-02, PNorm = 742.6108, GNorm = 1.1737, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.080762
Epoch 8039
Loss = 1.7138e-01, PNorm = 742.6391, GNorm = 2.3802, lr_0 = 9.9439e-04
Loss = 3.6291e-02, PNorm = 742.6641, GNorm = 2.7066, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.074532
Epoch 8040
Loss = 2.8519e-02, PNorm = 742.6927, GNorm = 0.5699, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.074489
Epoch 8041
Loss = 5.0688e-02, PNorm = 742.7226, GNorm = 1.9771, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.084570
Epoch 8042
Loss = 4.1605e-02, PNorm = 742.7477, GNorm = 1.3766, lr_0 = 9.9439e-04
Validation binary_cross_entropy = 0.085945
Epoch 8043
Loss = 2.1053e-02, PNorm = 742.7770, GNorm = 1.0377, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.090719
Epoch 8044
Loss = 3.3803e-02, PNorm = 742.8024, GNorm = 2.5451, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.089533
Epoch 8045
Loss = 2.7433e-02, PNorm = 742.8297, GNorm = 1.1874, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.090324
Epoch 8046
Loss = 8.2569e-03, PNorm = 742.8598, GNorm = 0.1069, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.091556
Epoch 8047
Loss = 5.0389e-03, PNorm = 742.8851, GNorm = 1.0074, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.093111
Epoch 8048
Loss = 1.7190e-02, PNorm = 742.9113, GNorm = 2.7574, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.099517
Epoch 8049
Loss = 1.9555e-01, PNorm = 742.9339, GNorm = 2.7641, lr_0 = 9.9438e-04
Loss = 8.0152e-02, PNorm = 742.9592, GNorm = 0.1275, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.089297
Epoch 8050
Loss = 2.7124e-02, PNorm = 742.9911, GNorm = 0.6024, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.091602
Epoch 8051
Loss = 1.3188e-02, PNorm = 743.0211, GNorm = 0.6008, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.086502
Epoch 8052
Loss = 3.6364e-02, PNorm = 743.0399, GNorm = 1.0089, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.077369
Epoch 8053
Loss = 3.0926e-02, PNorm = 743.0727, GNorm = 3.9753, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.077738
Epoch 8054
Loss = 1.2223e-02, PNorm = 743.1264, GNorm = 0.8596, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.088041
Epoch 8055
Loss = 1.2259e-02, PNorm = 743.1539, GNorm = 1.7451, lr_0 = 9.9438e-04
Validation binary_cross_entropy = 0.086877
Epoch 8056
Loss = 1.5320e-02, PNorm = 743.1715, GNorm = 0.0882, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.093641
Epoch 8057
Loss = 5.0417e-02, PNorm = 743.1819, GNorm = 0.0662, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.086242
Epoch 8058
Loss = 4.3642e-02, PNorm = 743.2118, GNorm = 0.0457, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.085818
Epoch 8059
Loss = 6.8223e-02, PNorm = 743.2446, GNorm = 1.5006, lr_0 = 9.9437e-04
Loss = 3.0029e-02, PNorm = 743.2702, GNorm = 0.2314, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.079582
Epoch 8060
Loss = 3.8961e-02, PNorm = 743.2904, GNorm = 1.1199, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.075309
Epoch 8061
Loss = 2.8022e-02, PNorm = 743.3266, GNorm = 2.4185, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.079449
Epoch 8062
Loss = 4.1000e-02, PNorm = 743.3771, GNorm = 0.1854, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.085329
Epoch 8063
Loss = 1.6464e-02, PNorm = 743.4115, GNorm = 2.0833, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.089783
Epoch 8064
Loss = 2.2322e-02, PNorm = 743.4315, GNorm = 1.6616, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.088337
Epoch 8065
Loss = 6.7879e-03, PNorm = 743.4616, GNorm = 2.4240, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.089949
Epoch 8066
Loss = 2.5767e-02, PNorm = 743.4833, GNorm = 0.3637, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.074542
Epoch 8067
Loss = 1.8357e-02, PNorm = 743.5097, GNorm = 0.1377, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.074621
Epoch 8068
Loss = 2.2109e-02, PNorm = 743.5491, GNorm = 1.5763, lr_0 = 9.9437e-04
Validation binary_cross_entropy = 0.070129
Epoch 8069
Loss = 4.4967e-03, PNorm = 743.5874, GNorm = 0.2233, lr_0 = 9.9437e-04
Loss = 2.9698e-02, PNorm = 743.6144, GNorm = 0.2738, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.068443
Epoch 8070
Loss = 2.3499e-02, PNorm = 743.6455, GNorm = 0.3063, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.071863
Epoch 8071
Loss = 4.6015e-02, PNorm = 743.6785, GNorm = 0.9827, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.076927
Epoch 8072
Loss = 7.0088e-02, PNorm = 743.7014, GNorm = 0.2292, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.078766
Epoch 8073
Loss = 4.5086e-02, PNorm = 743.7256, GNorm = 0.5154, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.071135
Epoch 8074
Loss = 1.9136e-02, PNorm = 743.7656, GNorm = 1.1719, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.074629
Epoch 8075
Loss = 1.6707e-02, PNorm = 743.8181, GNorm = 1.5421, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.086401
Epoch 8076
Loss = 5.3369e-02, PNorm = 743.8578, GNorm = 0.5048, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.087192
Epoch 8077
Loss = 2.3978e-02, PNorm = 743.8900, GNorm = 0.2145, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.088118
Epoch 8078
Loss = 2.5637e-02, PNorm = 743.9353, GNorm = 3.5311, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.090080
Epoch 8079
Loss = 2.4646e-02, PNorm = 743.9695, GNorm = 2.3409, lr_0 = 9.9436e-04
Loss = 2.3377e-02, PNorm = 744.0080, GNorm = 0.1177, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.124387
Epoch 8080
Loss = 3.4673e-02, PNorm = 744.0559, GNorm = 0.4131, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.116140
Epoch 8081
Loss = 7.0053e-02, PNorm = 744.0785, GNorm = 0.0741, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.104541
Epoch 8082
Loss = 2.6854e-01, PNorm = 744.1261, GNorm = 4.9527, lr_0 = 9.9436e-04
Validation binary_cross_entropy = 0.096227
Epoch 8083
Loss = 1.1875e-01, PNorm = 744.2097, GNorm = 10.4262, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.090708
Epoch 8084
Loss = 1.4301e-01, PNorm = 744.2846, GNorm = 2.0684, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.142714
Epoch 8085
Loss = 1.4953e-01, PNorm = 744.3461, GNorm = 2.7741, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.118063
Epoch 8086
Loss = 8.2901e-02, PNorm = 744.4170, GNorm = 6.7057, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.084654
Epoch 8087
Loss = 6.0187e-02, PNorm = 744.4722, GNorm = 4.1133, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.140036
Epoch 8088
Loss = 1.4387e-02, PNorm = 744.5512, GNorm = 3.3758, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.113553
Epoch 8089
Loss = 4.7246e-03, PNorm = 744.5941, GNorm = 0.2010, lr_0 = 9.9435e-04
Loss = 3.6783e-02, PNorm = 744.6334, GNorm = 0.4415, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.120217
Epoch 8090
Loss = 3.1776e-02, PNorm = 744.6690, GNorm = 0.0789, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.120714
Epoch 8091
Loss = 5.6736e-02, PNorm = 744.7016, GNorm = 2.1485, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.138852
Epoch 8092
Loss = 2.7082e-02, PNorm = 744.7283, GNorm = 1.2638, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.104525
Epoch 8093
Loss = 2.7658e-02, PNorm = 744.7563, GNorm = 1.9461, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.108428
Epoch 8094
Loss = 4.1760e-02, PNorm = 744.7948, GNorm = 0.3777, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.124334
Epoch 8095
Loss = 7.8036e-03, PNorm = 744.8283, GNorm = 0.1574, lr_0 = 9.9435e-04
Validation binary_cross_entropy = 0.110911
Epoch 8096
Loss = 4.7157e-02, PNorm = 744.8544, GNorm = 2.9058, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.103809
Epoch 8097
Loss = 1.2880e-02, PNorm = 744.8800, GNorm = 0.2488, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.110674
Epoch 8098
Loss = 3.1500e-02, PNorm = 744.9039, GNorm = 2.5915, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.109882
Epoch 8099
Loss = 2.1238e-03, PNorm = 744.9362, GNorm = 0.1140, lr_0 = 9.9434e-04
Loss = 1.7992e-02, PNorm = 744.9710, GNorm = 0.2743, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.113513
Epoch 8100
Loss = 3.4930e-02, PNorm = 744.9933, GNorm = 0.8144, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.103292
Epoch 8101
Loss = 4.4029e-02, PNorm = 745.0281, GNorm = 0.4754, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.102010
Epoch 8102
Loss = 9.0843e-02, PNorm = 745.0642, GNorm = 0.1674, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.142618
Epoch 8103
Loss = 6.9776e-02, PNorm = 745.0900, GNorm = 3.0952, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.119243
Epoch 8104
Loss = 1.4270e-02, PNorm = 745.1195, GNorm = 1.9991, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.104010
Epoch 8105
Loss = 2.1829e-02, PNorm = 745.1473, GNorm = 2.5366, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.101611
Epoch 8106
Loss = 5.6235e-03, PNorm = 745.1813, GNorm = 0.1437, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.106702
Epoch 8107
Loss = 2.6122e-02, PNorm = 745.2117, GNorm = 2.2257, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.108163
Epoch 8108
Loss = 1.5194e-02, PNorm = 745.2382, GNorm = 0.2353, lr_0 = 9.9434e-04
Validation binary_cross_entropy = 0.111359
Epoch 8109
Loss = 1.2529e-03, PNorm = 745.2634, GNorm = 0.0886, lr_0 = 9.9434e-04
Loss = 3.7825e-02, PNorm = 745.2836, GNorm = 0.0657, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.105972
Epoch 8110
Loss = 9.2310e-02, PNorm = 745.3227, GNorm = 8.2924, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.102136
Epoch 8111
Loss = 6.2187e-02, PNorm = 745.3789, GNorm = 1.0571, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.113133
Epoch 8112
Loss = 3.8465e-02, PNorm = 745.4184, GNorm = 1.3403, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.099353
Epoch 8113
Loss = 3.7220e-02, PNorm = 745.4690, GNorm = 1.1106, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.096858
Epoch 8114
Loss = 1.0224e-01, PNorm = 745.5278, GNorm = 0.2683, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.089115
Epoch 8115
Loss = 2.5310e-02, PNorm = 745.5811, GNorm = 4.9651, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.090689
Epoch 8116
Loss = 4.4840e-02, PNorm = 745.6195, GNorm = 0.7368, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.085027
Epoch 8117
Loss = 1.2889e-02, PNorm = 745.6534, GNorm = 0.0632, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.086792
Epoch 8118
Loss = 9.0377e-02, PNorm = 745.6931, GNorm = 10.3613, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.082137
Epoch 8119
Loss = 5.7396e-03, PNorm = 745.7489, GNorm = 0.2078, lr_0 = 9.9433e-04
Loss = 2.9789e-02, PNorm = 745.7965, GNorm = 0.1411, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.087958
Epoch 8120
Loss = 5.9645e-03, PNorm = 745.8304, GNorm = 0.6482, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.096714
Epoch 8121
Loss = 1.9299e-02, PNorm = 745.8495, GNorm = 0.5069, lr_0 = 9.9433e-04
Validation binary_cross_entropy = 0.098086
Epoch 8122
Loss = 7.9334e-03, PNorm = 745.8623, GNorm = 0.0266, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.098349
Epoch 8123
Loss = 6.5408e-02, PNorm = 745.8825, GNorm = 5.0991, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.095431
Epoch 8124
Loss = 2.5361e-02, PNorm = 745.9096, GNorm = 1.3509, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.099122
Epoch 8125
Loss = 4.6176e-03, PNorm = 745.9359, GNorm = 0.1418, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.096044
Epoch 8126
Loss = 1.1504e-02, PNorm = 745.9525, GNorm = 0.0892, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.096933
Epoch 8127
Loss = 2.3745e-02, PNorm = 745.9740, GNorm = 0.3852, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.096798
Epoch 8128
Loss = 1.4632e-02, PNorm = 746.0062, GNorm = 1.7187, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.118080
Epoch 8129
Loss = 7.3501e-02, PNorm = 746.0470, GNorm = 6.2635, lr_0 = 9.9432e-04
Loss = 3.6724e-02, PNorm = 746.0882, GNorm = 0.0989, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.092516
Epoch 8130
Loss = 6.1854e-02, PNorm = 746.1314, GNorm = 2.5432, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.097616
Epoch 8131
Loss = 4.3310e-02, PNorm = 746.1687, GNorm = 2.0932, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.097346
Epoch 8132
Loss = 2.6213e-02, PNorm = 746.1984, GNorm = 0.2967, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.095991
Epoch 8133
Loss = 2.9671e-02, PNorm = 746.2305, GNorm = 0.1544, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.101540
Epoch 8134
Loss = 2.9929e-02, PNorm = 746.2617, GNorm = 5.0711, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.099921
Epoch 8135
Loss = 2.4528e-02, PNorm = 746.2831, GNorm = 3.4659, lr_0 = 9.9432e-04
Validation binary_cross_entropy = 0.112703
Epoch 8136
Loss = 1.1007e-01, PNorm = 746.3353, GNorm = 0.8456, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.112699
Epoch 8137
Loss = 7.8193e-02, PNorm = 746.3866, GNorm = 0.6523, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.113052
Epoch 8138
Loss = 7.4776e-03, PNorm = 746.4180, GNorm = 0.0339, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.101435
Epoch 8139
Loss = 3.5621e-03, PNorm = 746.4388, GNorm = 0.5028, lr_0 = 9.9431e-04
Loss = 3.1874e-01, PNorm = 746.5740, GNorm = 11.6287, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.154184
Epoch 8140
Loss = 1.7466e-01, PNorm = 746.7174, GNorm = 2.0914, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.091166
Epoch 8141
Loss = 1.7998e-01, PNorm = 746.8351, GNorm = 2.7996, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.179521
Epoch 8142
Loss = 2.6676e-01, PNorm = 746.9193, GNorm = 25.7680, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.144156
Epoch 8143
Loss = 1.9771e-01, PNorm = 747.0064, GNorm = 2.1827, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.085585
Epoch 8144
Loss = 1.2786e-01, PNorm = 747.0848, GNorm = 5.1357, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.149668
Epoch 8145
Loss = 1.5064e-01, PNorm = 747.1491, GNorm = 1.9674, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.087832
Epoch 8146
Loss = 1.0476e-01, PNorm = 747.2107, GNorm = 1.3778, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.108495
Epoch 8147
Loss = 6.1134e-02, PNorm = 747.2758, GNorm = 2.2300, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.107259
Epoch 8148
Loss = 2.0445e-02, PNorm = 747.3312, GNorm = 0.2519, lr_0 = 9.9431e-04
Validation binary_cross_entropy = 0.090666
Epoch 8149
Loss = 9.2268e-03, PNorm = 747.3795, GNorm = 0.3993, lr_0 = 9.9431e-04
Loss = 4.4883e-02, PNorm = 747.4251, GNorm = 1.0818, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.106900
Epoch 8150
Loss = 4.3332e-02, PNorm = 747.4634, GNorm = 1.0172, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.090882
Epoch 8151
Loss = 7.8091e-02, PNorm = 747.5120, GNorm = 4.2775, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.095995
Epoch 8152
Loss = 1.0976e-01, PNorm = 747.5608, GNorm = 0.2259, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.099663
Epoch 8153
Loss = 8.3908e-02, PNorm = 747.6133, GNorm = 1.0677, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.125011
Epoch 8154
Loss = 4.6887e-02, PNorm = 747.6519, GNorm = 1.8702, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.100875
Epoch 8155
Loss = 5.1289e-02, PNorm = 747.6871, GNorm = 3.5775, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.090282
Epoch 8156
Loss = 1.1987e-01, PNorm = 747.7222, GNorm = 4.0287, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.086514
Epoch 8157
Loss = 4.7540e-02, PNorm = 747.7825, GNorm = 2.3213, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.085979
Epoch 8158
Loss = 3.6726e-02, PNorm = 747.8303, GNorm = 1.1792, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.101909
Epoch 8159
Loss = 5.5233e-02, PNorm = 747.8778, GNorm = 1.7840, lr_0 = 9.9430e-04
Loss = 2.0003e-02, PNorm = 747.9116, GNorm = 0.4027, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.134820
Epoch 8160
Loss = 6.6610e-02, PNorm = 747.9330, GNorm = 0.2481, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.099035
Epoch 8161
Loss = 6.3967e-02, PNorm = 747.9897, GNorm = 0.1065, lr_0 = 9.9430e-04
Validation binary_cross_entropy = 0.108315
Epoch 8162
Loss = 4.5909e-02, PNorm = 748.0443, GNorm = 6.9350, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.094533
Epoch 8163
Loss = 4.2221e-02, PNorm = 748.0952, GNorm = 0.7361, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.091995
Epoch 8164
Loss = 4.8750e-02, PNorm = 748.1435, GNorm = 0.3045, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.100728
Epoch 8165
Loss = 3.1268e-02, PNorm = 748.1887, GNorm = 4.2593, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.092129
Epoch 8166
Loss = 2.0746e-02, PNorm = 748.2256, GNorm = 1.7026, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.094905
Epoch 8167
Loss = 9.0308e-03, PNorm = 748.2675, GNorm = 1.2685, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.099267
Epoch 8168
Loss = 1.2847e-01, PNorm = 748.3038, GNorm = 2.6935, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.099500
Epoch 8169
Loss = 2.1086e-03, PNorm = 748.3368, GNorm = 0.3175, lr_0 = 9.9429e-04
Loss = 2.8574e-02, PNorm = 748.3695, GNorm = 6.1437, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.109872
Epoch 8170
Loss = 4.7276e-02, PNorm = 748.3961, GNorm = 0.3395, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.110828
Epoch 8171
Loss = 4.3322e-02, PNorm = 748.4250, GNorm = 1.3125, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.109889
Epoch 8172
Loss = 1.3072e-02, PNorm = 748.4492, GNorm = 2.0681, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.103531
Epoch 8173
Loss = 4.5758e-02, PNorm = 748.4846, GNorm = 3.7388, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.124956
Epoch 8174
Loss = 4.1271e-02, PNorm = 748.5241, GNorm = 3.0767, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.128072
Epoch 8175
Loss = 2.6402e-02, PNorm = 748.5480, GNorm = 2.7345, lr_0 = 9.9429e-04
Validation binary_cross_entropy = 0.110001
Epoch 8176
Loss = 1.6860e-02, PNorm = 748.5794, GNorm = 0.8117, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.113693
Epoch 8177
Loss = 4.6093e-03, PNorm = 748.6249, GNorm = 0.4194, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.133374
Epoch 8178
Loss = 1.6150e-02, PNorm = 748.6735, GNorm = 3.0088, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.138376
Epoch 8179
Loss = 9.2955e-02, PNorm = 748.7194, GNorm = 3.4426, lr_0 = 9.9428e-04
Loss = 3.8825e-02, PNorm = 748.8020, GNorm = 1.5763, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.094818
Epoch 8180
Loss = 1.7073e-02, PNorm = 748.8734, GNorm = 0.3981, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.110363
Epoch 8181
Loss = 5.6329e-02, PNorm = 748.9212, GNorm = 0.7558, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.104378
Epoch 8182
Loss = 8.9142e-02, PNorm = 748.9816, GNorm = 8.9976, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.122589
Epoch 8183
Loss = 5.6803e-02, PNorm = 749.0512, GNorm = 3.3998, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.127337
Epoch 8184
Loss = 2.9468e-02, PNorm = 749.1026, GNorm = 1.2808, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.115320
Epoch 8185
Loss = 8.8938e-02, PNorm = 749.1615, GNorm = 0.4443, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.131655
Epoch 8186
Loss = 2.4347e-02, PNorm = 749.2094, GNorm = 0.5597, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.120327
Epoch 8187
Loss = 2.2747e-02, PNorm = 749.2377, GNorm = 1.3373, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.123372
Epoch 8188
Loss = 1.5237e-02, PNorm = 749.2769, GNorm = 0.2738, lr_0 = 9.9428e-04
Validation binary_cross_entropy = 0.115797
Epoch 8189
Loss = 1.3350e-02, PNorm = 749.3070, GNorm = 1.3767, lr_0 = 9.9427e-04
Loss = 2.3775e-02, PNorm = 749.3336, GNorm = 0.1192, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.098141
Epoch 8190
Loss = 2.4685e-02, PNorm = 749.3716, GNorm = 0.2077, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.105448
Epoch 8191
Loss = 8.8926e-02, PNorm = 749.3980, GNorm = 3.0265, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.091146
Epoch 8192
Loss = 3.5892e-02, PNorm = 749.4396, GNorm = 0.2689, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.105825
Epoch 8193
Loss = 6.1445e-02, PNorm = 749.4831, GNorm = 1.3183, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.099182
Epoch 8194
Loss = 9.7331e-02, PNorm = 749.5382, GNorm = 0.1058, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.096717
Epoch 8195
Loss = 7.2617e-03, PNorm = 749.6269, GNorm = 0.2332, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.123946
Epoch 8196
Loss = 1.5710e-02, PNorm = 749.6808, GNorm = 1.2932, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.104351
Epoch 8197
Loss = 1.8829e-02, PNorm = 749.7048, GNorm = 0.5784, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.098616
Epoch 8198
Loss = 9.5961e-03, PNorm = 749.7314, GNorm = 0.7970, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.103927
Epoch 8199
Loss = 3.2295e-03, PNorm = 749.7587, GNorm = 0.2714, lr_0 = 9.9427e-04
Loss = 1.6831e-02, PNorm = 749.7898, GNorm = 0.1297, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.109849
Epoch 8200
Loss = 4.0485e-02, PNorm = 749.8190, GNorm = 0.0865, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.119097
Epoch 8201
Loss = 5.5870e-02, PNorm = 749.8286, GNorm = 0.3009, lr_0 = 9.9427e-04
Validation binary_cross_entropy = 0.104329
Epoch 8202
Loss = 2.9860e-02, PNorm = 749.8459, GNorm = 0.1764, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.098026
Epoch 8203
Loss = 4.1262e-02, PNorm = 749.8682, GNorm = 1.0485, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.101696
Epoch 8204
Loss = 3.7234e-02, PNorm = 749.8991, GNorm = 0.4574, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.102979
Epoch 8205
Loss = 1.5940e-02, PNorm = 749.9331, GNorm = 0.0930, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.121685
Epoch 8206
Loss = 1.0875e-02, PNorm = 749.9721, GNorm = 0.1166, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.159068
Epoch 8207
Loss = 1.3212e-02, PNorm = 749.9979, GNorm = 0.0547, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.146094
Epoch 8208
Loss = 3.0519e-03, PNorm = 750.0127, GNorm = 0.0736, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.113370
Epoch 8209
Loss = 1.0012e-02, PNorm = 750.0267, GNorm = 0.5160, lr_0 = 9.9426e-04
Loss = 4.5743e-02, PNorm = 750.0586, GNorm = 0.6120, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.099640
Epoch 8210
Loss = 2.5692e-02, PNorm = 750.1001, GNorm = 1.1734, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.097073
Epoch 8211
Loss = 1.1185e-02, PNorm = 750.1344, GNorm = 0.0732, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.080845
Epoch 8212
Loss = 1.0569e-01, PNorm = 750.2178, GNorm = 13.8509, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.131369
Epoch 8213
Loss = 1.2410e-01, PNorm = 750.3086, GNorm = 4.9003, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.093699
Epoch 8214
Loss = 1.2069e-01, PNorm = 750.3680, GNorm = 8.2871, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.090923
Epoch 8215
Loss = 5.5632e-02, PNorm = 750.4217, GNorm = 0.9679, lr_0 = 9.9426e-04
Validation binary_cross_entropy = 0.077301
Epoch 8216
Loss = 1.8667e-02, PNorm = 750.4666, GNorm = 1.4478, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.077357
Epoch 8217
Loss = 2.4991e-02, PNorm = 750.5068, GNorm = 0.5773, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.094296
Epoch 8218
Loss = 5.9707e-02, PNorm = 750.5423, GNorm = 5.4782, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.082399
Epoch 8219
Loss = 1.3484e-02, PNorm = 750.5673, GNorm = 0.8953, lr_0 = 9.9425e-04
Loss = 1.3798e-02, PNorm = 750.6002, GNorm = 0.1466, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.091101
Epoch 8220
Loss = 2.6452e-02, PNorm = 750.6303, GNorm = 3.0675, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.108480
Epoch 8221
Loss = 4.7699e-02, PNorm = 750.6543, GNorm = 2.1541, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.085865
Epoch 8222
Loss = 2.9749e-02, PNorm = 750.6948, GNorm = 0.9656, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.091980
Epoch 8223
Loss = 2.6118e-02, PNorm = 750.7309, GNorm = 1.7048, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.091939
Epoch 8224
Loss = 2.7558e-02, PNorm = 750.7577, GNorm = 1.4944, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.088323
Epoch 8225
Loss = 6.1351e-02, PNorm = 750.7887, GNorm = 0.1743, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.090481
Epoch 8226
Loss = 8.6003e-02, PNorm = 750.8347, GNorm = 0.1549, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.091418
Epoch 8227
Loss = 9.2131e-03, PNorm = 750.8742, GNorm = 0.4616, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.101748
Epoch 8228
Loss = 3.8036e-03, PNorm = 750.9127, GNorm = 0.1926, lr_0 = 9.9425e-04
Validation binary_cross_entropy = 0.115924
Epoch 8229
Loss = 2.8648e-04, PNorm = 750.9418, GNorm = 0.0237, lr_0 = 9.9424e-04
Loss = 4.7024e-02, PNorm = 750.9564, GNorm = 0.3449, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.092424
Epoch 8230
Loss = 3.2016e-02, PNorm = 750.9807, GNorm = 3.9864, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.090007
Epoch 8231
Loss = 3.0492e-02, PNorm = 751.0068, GNorm = 0.1486, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.095269
Epoch 8232
Loss = 1.3477e-02, PNorm = 751.0290, GNorm = 0.2481, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.131952
Epoch 8233
Loss = 1.9258e-02, PNorm = 751.0603, GNorm = 0.1125, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.113189
Epoch 8234
Loss = 2.5140e-02, PNorm = 751.0943, GNorm = 0.9743, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.119365
Epoch 8235
Loss = 8.6295e-02, PNorm = 751.1182, GNorm = 0.8806, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.076416
Epoch 8236
Loss = 7.5774e-02, PNorm = 751.1484, GNorm = 1.8108, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.088187
Epoch 8237
Loss = 3.3507e-02, PNorm = 751.1972, GNorm = 1.2686, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.104111
Epoch 8238
Loss = 2.6166e-02, PNorm = 751.2390, GNorm = 1.2841, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.096556
Epoch 8239
Loss = 9.0724e-03, PNorm = 751.2663, GNorm = 0.7709, lr_0 = 9.9424e-04
Loss = 4.0307e-02, PNorm = 751.2884, GNorm = 1.8381, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.089543
Epoch 8240
Loss = 1.7323e-02, PNorm = 751.3226, GNorm = 0.1840, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.092427
Epoch 8241
Loss = 1.6605e-02, PNorm = 751.3571, GNorm = 0.2791, lr_0 = 9.9424e-04
Validation binary_cross_entropy = 0.097501
Epoch 8242
Loss = 1.7600e-02, PNorm = 751.3803, GNorm = 1.1361, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.100811
Epoch 8243
Loss = 6.9641e-02, PNorm = 751.4001, GNorm = 0.3783, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.103033
Epoch 8244
Loss = 2.5922e-02, PNorm = 751.4212, GNorm = 0.7358, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.098485
Epoch 8245
Loss = 4.5557e-03, PNorm = 751.4561, GNorm = 0.2416, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.093832
Epoch 8246
Loss = 8.2608e-02, PNorm = 751.5168, GNorm = 0.7221, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.100935
Epoch 8247
Loss = 1.4511e-02, PNorm = 751.5624, GNorm = 1.0014, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.096775
Epoch 8248
Loss = 9.7103e-03, PNorm = 751.5866, GNorm = 0.8888, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.082533
Epoch 8249
Loss = 7.3910e-04, PNorm = 751.6008, GNorm = 0.0772, lr_0 = 9.9423e-04
Loss = 3.9720e-02, PNorm = 751.6279, GNorm = 2.2427, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.091064
Epoch 8250
Loss = 3.2814e-02, PNorm = 751.6637, GNorm = 0.1816, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.098561
Epoch 8251
Loss = 4.3510e-02, PNorm = 751.6916, GNorm = 1.2736, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.085629
Epoch 8252
Loss = 3.3059e-02, PNorm = 751.7229, GNorm = 1.4385, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.084160
Epoch 8253
Loss = 1.3174e-02, PNorm = 751.7704, GNorm = 0.1331, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.093631
Epoch 8254
Loss = 2.2156e-02, PNorm = 751.8095, GNorm = 0.8715, lr_0 = 9.9423e-04
Validation binary_cross_entropy = 0.088604
Epoch 8255
Loss = 1.2422e-02, PNorm = 751.8361, GNorm = 0.2710, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.087093
Epoch 8256
Loss = 9.8713e-03, PNorm = 751.8665, GNorm = 0.1040, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.090318
Epoch 8257
Loss = 4.4228e-02, PNorm = 751.8904, GNorm = 0.1383, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.083937
Epoch 8258
Loss = 3.1289e-02, PNorm = 751.9217, GNorm = 2.9781, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.088165
Epoch 8259
Loss = 2.2865e-02, PNorm = 751.9648, GNorm = 1.0458, lr_0 = 9.9422e-04
Loss = 1.1203e-02, PNorm = 752.0010, GNorm = 0.1165, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.101763
Epoch 8260
Loss = 8.9375e-02, PNorm = 752.0255, GNorm = 28.8651, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.111839
Epoch 8261
Loss = 2.8023e-02, PNorm = 752.0620, GNorm = 0.0736, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.093515
Epoch 8262
Loss = 5.3207e-02, PNorm = 752.1049, GNorm = 1.6364, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.104300
Epoch 8263
Loss = 3.1539e-02, PNorm = 752.1471, GNorm = 6.1683, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.081656
Epoch 8264
Loss = 2.0359e-02, PNorm = 752.1826, GNorm = 0.1886, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.077156
Epoch 8265
Loss = 1.1761e-02, PNorm = 752.2235, GNorm = 0.5685, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.079989
Epoch 8266
Loss = 5.3490e-03, PNorm = 752.2593, GNorm = 0.2042, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.075244
Epoch 8267
Loss = 4.0327e-02, PNorm = 752.2801, GNorm = 0.4515, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.093200
Epoch 8268
Loss = 1.8655e-02, PNorm = 752.3341, GNorm = 0.7726, lr_0 = 9.9422e-04
Validation binary_cross_entropy = 0.122579
Epoch 8269
Loss = 2.6379e-02, PNorm = 752.3668, GNorm = 2.6392, lr_0 = 9.9421e-04
Loss = 5.4998e-02, PNorm = 752.3885, GNorm = 0.3014, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.111517
Epoch 8270
Loss = 1.2836e-02, PNorm = 752.4214, GNorm = 1.4930, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.099575
Epoch 8271
Loss = 1.8143e-02, PNorm = 752.4527, GNorm = 2.4856, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.112212
Epoch 8272
Loss = 3.7056e-02, PNorm = 752.4779, GNorm = 0.0881, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.090622
Epoch 8273
Loss = 6.6007e-02, PNorm = 752.5255, GNorm = 4.6806, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.087687
Epoch 8274
Loss = 3.9505e-02, PNorm = 752.5865, GNorm = 0.5161, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.091371
Epoch 8275
Loss = 1.4777e-02, PNorm = 752.6348, GNorm = 0.2400, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.104329
Epoch 8276
Loss = 3.9264e-02, PNorm = 752.6763, GNorm = 1.4756, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.121268
Epoch 8277
Loss = 7.2474e-02, PNorm = 752.7204, GNorm = 0.2233, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.094025
Epoch 8278
Loss = 1.0956e-01, PNorm = 752.7533, GNorm = 0.3852, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.087248
Epoch 8279
Loss = 7.2199e-02, PNorm = 752.7966, GNorm = 1.4411, lr_0 = 9.9421e-04
Loss = 2.4538e-02, PNorm = 752.8474, GNorm = 0.1424, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.103729
Epoch 8280
Loss = 1.2599e-02, PNorm = 752.8928, GNorm = 0.7932, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.154588
Epoch 8281
Loss = 1.3575e-02, PNorm = 752.9288, GNorm = 0.4587, lr_0 = 9.9421e-04
Validation binary_cross_entropy = 0.180651
Epoch 8282
Loss = 7.7418e-02, PNorm = 752.9625, GNorm = 0.0431, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.128006
Epoch 8283
Loss = 1.7220e-02, PNorm = 752.9901, GNorm = 2.6054, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.122347
Epoch 8284
Loss = 4.4266e-02, PNorm = 753.0212, GNorm = 3.8752, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.105089
Epoch 8285
Loss = 5.4642e-02, PNorm = 753.0603, GNorm = 3.3397, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.075479
Epoch 8286
Loss = 4.8706e-02, PNorm = 753.1288, GNorm = 5.9575, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.087080
Epoch 8287
Loss = 3.1109e-02, PNorm = 753.2007, GNorm = 1.9278, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.092927
Epoch 8288
Loss = 1.7170e-02, PNorm = 753.2507, GNorm = 0.2449, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.093718
Epoch 8289
Loss = 1.9914e-02, PNorm = 753.2772, GNorm = 1.3791, lr_0 = 9.9420e-04
Loss = 8.3748e-03, PNorm = 753.2992, GNorm = 0.0207, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.093869
Epoch 8290
Loss = 1.3895e-02, PNorm = 753.3277, GNorm = 0.2272, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.111121
Epoch 8291
Loss = 2.6444e-02, PNorm = 753.3480, GNorm = 1.0782, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.112340
Epoch 8292
Loss = 1.9799e-02, PNorm = 753.3657, GNorm = 0.0285, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.103651
Epoch 8293
Loss = 4.1107e-02, PNorm = 753.3904, GNorm = 0.4597, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.102834
Epoch 8294
Loss = 1.0792e-02, PNorm = 753.4255, GNorm = 0.0836, lr_0 = 9.9420e-04
Validation binary_cross_entropy = 0.112181
Epoch 8295
Loss = 4.1070e-02, PNorm = 753.4511, GNorm = 2.5714, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.095458
Epoch 8296
Loss = 2.3842e-02, PNorm = 753.4826, GNorm = 1.9731, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.089022
Epoch 8297
Loss = 4.8613e-02, PNorm = 753.5135, GNorm = 5.6699, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.102596
Epoch 8298
Loss = 3.0527e-01, PNorm = 753.5492, GNorm = 8.0961, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.073534
Epoch 8299
Loss = 7.0050e-03, PNorm = 753.6128, GNorm = 0.2819, lr_0 = 9.9419e-04
Loss = 2.8330e-02, PNorm = 753.6596, GNorm = 2.1166, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.074518
Epoch 8300
Loss = 4.1308e-02, PNorm = 753.7005, GNorm = 2.5480, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.081527
Epoch 8301
Loss = 2.6660e-02, PNorm = 753.7332, GNorm = 1.7475, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.080563
Epoch 8302
Loss = 1.6963e-02, PNorm = 753.7607, GNorm = 0.1929, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.076256
Epoch 8303
Loss = 3.0074e-02, PNorm = 753.7862, GNorm = 4.8315, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.085021
Epoch 8304
Loss = 7.6975e-03, PNorm = 753.8198, GNorm = 0.4511, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.097698
Epoch 8305
Loss = 5.5415e-02, PNorm = 753.8499, GNorm = 6.6198, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.085953
Epoch 8306
Loss = 4.5540e-02, PNorm = 753.8980, GNorm = 4.3139, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.113263
Epoch 8307
Loss = 9.6501e-02, PNorm = 753.9459, GNorm = 0.0948, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.082017
Epoch 8308
Loss = 1.7436e-02, PNorm = 753.9823, GNorm = 0.3481, lr_0 = 9.9419e-04
Validation binary_cross_entropy = 0.085813
Epoch 8309
Loss = 4.1145e-03, PNorm = 754.0151, GNorm = 0.2582, lr_0 = 9.9418e-04
Loss = 4.0198e-02, PNorm = 754.0495, GNorm = 1.1751, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.092085
Epoch 8310
Loss = 2.8042e-02, PNorm = 754.0780, GNorm = 1.0676, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.084113
Epoch 8311
Loss = 1.2949e-02, PNorm = 754.1014, GNorm = 0.1302, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.082685
Epoch 8312
Loss = 3.1604e-02, PNorm = 754.1198, GNorm = 0.0959, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.082721
Epoch 8313
Loss = 2.5496e-01, PNorm = 754.1344, GNorm = 4.4946, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.058376
Epoch 8314
Loss = 2.9305e-02, PNorm = 754.1788, GNorm = 0.7262, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.071311
Epoch 8315
Loss = 2.3331e-02, PNorm = 754.2225, GNorm = 1.1536, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.076724
Epoch 8316
Loss = 1.8366e-02, PNorm = 754.2460, GNorm = 0.1162, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.072582
Epoch 8317
Loss = 6.0253e-02, PNorm = 754.2695, GNorm = 0.1602, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.078035
Epoch 8318
Loss = 2.6416e-02, PNorm = 754.3167, GNorm = 1.5051, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.085157
Epoch 8319
Loss = 1.0820e-02, PNorm = 754.3494, GNorm = 1.1785, lr_0 = 9.9418e-04
Loss = 9.7868e-03, PNorm = 754.3725, GNorm = 0.3055, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.083309
Epoch 8320
Loss = 2.2030e-02, PNorm = 754.4113, GNorm = 0.6180, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.092678
Epoch 8321
Loss = 3.4092e-02, PNorm = 754.4397, GNorm = 1.4626, lr_0 = 9.9418e-04
Validation binary_cross_entropy = 0.088925
Epoch 8322
Loss = 3.0533e-02, PNorm = 754.4601, GNorm = 2.5818, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.087193
Epoch 8323
Loss = 5.9092e-03, PNorm = 754.4859, GNorm = 0.1025, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.087526
Epoch 8324
Loss = 1.2717e-02, PNorm = 754.5029, GNorm = 0.0978, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.083170
Epoch 8325
Loss = 1.7203e-02, PNorm = 754.5334, GNorm = 0.1342, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.084645
Epoch 8326
Loss = 1.3792e-02, PNorm = 754.5732, GNorm = 3.0746, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.095130
Epoch 8327
Loss = 2.5046e-02, PNorm = 754.6097, GNorm = 0.5326, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.095243
Epoch 8328
Loss = 2.2220e-03, PNorm = 754.6364, GNorm = 0.0695, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.098994
Epoch 8329
Loss = 2.2684e-02, PNorm = 754.6652, GNorm = 1.6211, lr_0 = 9.9417e-04
Loss = 1.7282e-02, PNorm = 754.6874, GNorm = 0.4705, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.091962
Epoch 8330
Loss = 1.8102e-02, PNorm = 754.7152, GNorm = 0.3328, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.092959
Epoch 8331
Loss = 2.4236e-02, PNorm = 754.7549, GNorm = 0.2543, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.097259
Epoch 8332
Loss = 2.3278e-02, PNorm = 754.7853, GNorm = 0.1665, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.094385
Epoch 8333
Loss = 1.4163e-02, PNorm = 754.8129, GNorm = 2.5822, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.094082
Epoch 8334
Loss = 5.6554e-03, PNorm = 754.8478, GNorm = 0.4164, lr_0 = 9.9417e-04
Validation binary_cross_entropy = 0.097087
Epoch 8335
Loss = 7.8812e-03, PNorm = 754.8803, GNorm = 1.0079, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.092103
Epoch 8336
Loss = 8.6809e-02, PNorm = 754.9092, GNorm = 1.9596, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.084655
Epoch 8337
Loss = 7.2243e-02, PNorm = 754.9483, GNorm = 1.4464, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.080447
Epoch 8338
Loss = 6.6126e-03, PNorm = 755.0026, GNorm = 0.4727, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.081658
Epoch 8339
Loss = 7.7327e-03, PNorm = 755.0584, GNorm = 0.3335, lr_0 = 9.9416e-04
Loss = 1.0170e-02, PNorm = 755.1085, GNorm = 1.4956, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.097863
Epoch 8340
Loss = 2.9092e-02, PNorm = 755.1374, GNorm = 0.1502, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.091302
Epoch 8341
Loss = 2.8455e-02, PNorm = 755.1600, GNorm = 1.6823, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.080791
Epoch 8342
Loss = 1.1954e-02, PNorm = 755.1921, GNorm = 0.3365, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.082531
Epoch 8343
Loss = 2.4025e-02, PNorm = 755.2335, GNorm = 0.3957, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.088385
Epoch 8344
Loss = 3.1113e-02, PNorm = 755.2800, GNorm = 0.3780, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.088382
Epoch 8345
Loss = 4.2121e-02, PNorm = 755.3186, GNorm = 2.0328, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.102368
Epoch 8346
Loss = 2.8793e-03, PNorm = 755.3450, GNorm = 0.0594, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.096175
Epoch 8347
Loss = 1.9843e-02, PNorm = 755.3609, GNorm = 0.0894, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.078678
Epoch 8348
Loss = 7.6196e-02, PNorm = 755.3868, GNorm = 0.7374, lr_0 = 9.9416e-04
Validation binary_cross_entropy = 0.080685
Epoch 8349
Loss = 1.0246e-02, PNorm = 755.4466, GNorm = 1.0597, lr_0 = 9.9415e-04
Loss = 3.3191e-02, PNorm = 755.4874, GNorm = 0.7216, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.080937
Epoch 8350
Loss = 1.6353e-02, PNorm = 755.5176, GNorm = 0.2898, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.074145
Epoch 8351
Loss = 2.8936e-02, PNorm = 755.5600, GNorm = 0.2789, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.075877
Epoch 8352
Loss = 1.7584e-02, PNorm = 755.6015, GNorm = 0.1615, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.078860
Epoch 8353
Loss = 8.1409e-03, PNorm = 755.6306, GNorm = 1.1450, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.081706
Epoch 8354
Loss = 3.1522e-02, PNorm = 755.6600, GNorm = 2.9405, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.086525
Epoch 8355
Loss = 1.5985e-02, PNorm = 755.6926, GNorm = 0.4307, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.086468
Epoch 8356
Loss = 5.6618e-03, PNorm = 755.7242, GNorm = 3.2637, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.104127
Epoch 8357
Loss = 9.6420e-02, PNorm = 755.7633, GNorm = 0.1356, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.096539
Epoch 8358
Loss = 1.5645e-03, PNorm = 755.7988, GNorm = 0.1542, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.105087
Epoch 8359
Loss = 3.0218e-03, PNorm = 755.8236, GNorm = 0.2049, lr_0 = 9.9415e-04
Loss = 6.4926e-02, PNorm = 755.8385, GNorm = 0.0861, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.091050
Epoch 8360
Loss = 1.1938e-02, PNorm = 755.8557, GNorm = 0.5229, lr_0 = 9.9415e-04
Validation binary_cross_entropy = 0.085496
Epoch 8361
Loss = 2.4840e-02, PNorm = 755.8757, GNorm = 3.3845, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.086264
Epoch 8362
Loss = 7.4236e-02, PNorm = 755.8989, GNorm = 13.2537, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.077288
Epoch 8363
Loss = 1.7697e-02, PNorm = 755.9368, GNorm = 7.3388, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.084948
Epoch 8364
Loss = 2.3222e-02, PNorm = 755.9618, GNorm = 0.2894, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.081698
Epoch 8365
Loss = 3.4518e-02, PNorm = 755.9960, GNorm = 0.1491, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.090466
Epoch 8366
Loss = 7.3238e-03, PNorm = 756.0331, GNorm = 0.1570, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.104066
Epoch 8367
Loss = 4.1863e-03, PNorm = 756.0603, GNorm = 0.5388, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.103930
Epoch 8368
Loss = 7.6365e-02, PNorm = 756.0934, GNorm = 5.2860, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.117170
Epoch 8369
Loss = 7.7228e-03, PNorm = 756.1366, GNorm = 0.6714, lr_0 = 9.9414e-04
Loss = 2.7192e-02, PNorm = 756.1764, GNorm = 0.0765, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.109637
Epoch 8370
Loss = 4.0462e-02, PNorm = 756.2167, GNorm = 6.4456, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.101353
Epoch 8371
Loss = 4.5294e-02, PNorm = 756.2806, GNorm = 1.2284, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.094847
Epoch 8372
Loss = 3.3338e-02, PNorm = 756.3395, GNorm = 0.2719, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.115488
Epoch 8373
Loss = 1.4298e-02, PNorm = 756.3828, GNorm = 0.1502, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.109299
Epoch 8374
Loss = 5.1969e-02, PNorm = 756.4179, GNorm = 0.1569, lr_0 = 9.9414e-04
Validation binary_cross_entropy = 0.093789
Epoch 8375
Loss = 1.3793e-02, PNorm = 756.4675, GNorm = 0.1329, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.085328
Epoch 8376
Loss = 5.8401e-03, PNorm = 756.5132, GNorm = 1.2927, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.084246
Epoch 8377
Loss = 2.0103e-02, PNorm = 756.5480, GNorm = 0.1709, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.108648
Epoch 8378
Loss = 4.3763e-02, PNorm = 756.5856, GNorm = 0.2506, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.100232
Epoch 8379
Loss = 1.2348e-03, PNorm = 756.6190, GNorm = 0.0923, lr_0 = 9.9413e-04
Loss = 1.6983e-02, PNorm = 756.6472, GNorm = 0.1775, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.096002
Epoch 8380
Loss = 5.2209e-02, PNorm = 756.6684, GNorm = 0.1604, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.098979
Epoch 8381
Loss = 4.9673e-02, PNorm = 756.7202, GNorm = 1.4883, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.093346
Epoch 8382
Loss = 5.9799e-02, PNorm = 756.8020, GNorm = 0.2131, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.103855
Epoch 8383
Loss = 3.6291e-02, PNorm = 756.8834, GNorm = 0.2608, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.136351
Epoch 8384
Loss = 4.4632e-02, PNorm = 756.9297, GNorm = 0.1373, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.119442
Epoch 8385
Loss = 9.9010e-02, PNorm = 756.9640, GNorm = 5.5593, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.088702
Epoch 8386
Loss = 1.7859e-02, PNorm = 757.0306, GNorm = 0.9105, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.080392
Epoch 8387
Loss = 6.1707e-02, PNorm = 757.1361, GNorm = 0.3938, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.090160
Epoch 8388
Loss = 1.0861e-02, PNorm = 757.2161, GNorm = 0.7834, lr_0 = 9.9413e-04
Validation binary_cross_entropy = 0.087177
Epoch 8389
Loss = 2.9985e-02, PNorm = 757.2838, GNorm = 1.7679, lr_0 = 9.9412e-04
Loss = 1.4387e-02, PNorm = 757.3413, GNorm = 0.4947, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.110703
Epoch 8390
Loss = 2.7848e-02, PNorm = 757.3726, GNorm = 1.1215, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.103797
Epoch 8391
Loss = 6.7299e-03, PNorm = 757.4020, GNorm = 0.2479, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.101378
Epoch 8392
Loss = 7.5471e-02, PNorm = 757.4318, GNorm = 6.0826, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.088216
Epoch 8393
Loss = 3.9838e-02, PNorm = 757.4767, GNorm = 4.4905, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.090791
Epoch 8394
Loss = 8.1407e-02, PNorm = 757.5484, GNorm = 3.7342, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.114280
Epoch 8395
Loss = 3.3682e-02, PNorm = 757.6090, GNorm = 0.3871, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.125234
Epoch 8396
Loss = 2.2074e-02, PNorm = 757.6746, GNorm = 1.3982, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.082220
Epoch 8397
Loss = 1.6960e-01, PNorm = 757.7496, GNorm = 5.0638, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.120881
Epoch 8398
Loss = 1.4161e-01, PNorm = 757.8221, GNorm = 1.3635, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.152268
Epoch 8399
Loss = 2.3039e-01, PNorm = 757.8990, GNorm = 8.1374, lr_0 = 9.9412e-04
Loss = 4.3399e-02, PNorm = 757.9604, GNorm = 0.1073, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.105987
Epoch 8400
Loss = 1.7042e-02, PNorm = 758.0055, GNorm = 0.9663, lr_0 = 9.9412e-04
Validation binary_cross_entropy = 0.096704
Epoch 8401
Loss = 8.4787e-02, PNorm = 758.0443, GNorm = 2.5082, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.098527
Epoch 8402
Loss = 4.8336e-02, PNorm = 758.0748, GNorm = 0.7600, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.079753
Epoch 8403
Loss = 1.3345e-01, PNorm = 758.1278, GNorm = 0.5905, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.106561
Epoch 8404
Loss = 3.9360e-02, PNorm = 758.1851, GNorm = 0.1962, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.077836
Epoch 8405
Loss = 2.3020e-02, PNorm = 758.2310, GNorm = 1.3101, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.079635
Epoch 8406
Loss = 6.4404e-02, PNorm = 758.2909, GNorm = 3.1539, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.088112
Epoch 8407
Loss = 6.4742e-02, PNorm = 758.3397, GNorm = 1.4533, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.089896
Epoch 8408
Loss = 5.4227e-02, PNorm = 758.3805, GNorm = 3.5562, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.096807
Epoch 8409
Loss = 2.0333e+00, PNorm = 758.4183, GNorm = 43.7548, lr_0 = 9.9411e-04
Loss = 5.7600e-02, PNorm = 758.5044, GNorm = 0.0650, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.090560
Epoch 8410
Loss = 8.0593e-02, PNorm = 758.5922, GNorm = 0.6023, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.136803
Epoch 8411
Loss = 1.1745e-01, PNorm = 758.6539, GNorm = 0.5782, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.061556
Epoch 8412
Loss = 6.5723e-02, PNorm = 758.7697, GNorm = 1.5183, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.102248
Epoch 8413
Loss = 4.3830e-02, PNorm = 758.8618, GNorm = 2.2509, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.096788
Epoch 8414
Loss = 3.9343e-02, PNorm = 758.9116, GNorm = 0.1001, lr_0 = 9.9411e-04
Validation binary_cross_entropy = 0.083138
Epoch 8415
Loss = 3.3518e-02, PNorm = 758.9701, GNorm = 0.7077, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.074679
Epoch 8416
Loss = 4.4128e-02, PNorm = 759.0200, GNorm = 0.3360, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.074042
Epoch 8417
Loss = 5.2020e-02, PNorm = 759.0638, GNorm = 1.5955, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.068559
Epoch 8418
Loss = 4.1287e-02, PNorm = 759.0967, GNorm = 0.3189, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.059875
Epoch 8419
Loss = 1.2366e-02, PNorm = 759.1687, GNorm = 0.6633, lr_0 = 9.9410e-04
Loss = 6.0263e-02, PNorm = 759.2329, GNorm = 0.3885, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.086400
Epoch 8420
Loss = 4.5712e-02, PNorm = 759.2831, GNorm = 1.5154, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.069606
Epoch 8421
Loss = 5.0561e-02, PNorm = 759.3219, GNorm = 0.2344, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.075448
Epoch 8422
Loss = 3.8745e-02, PNorm = 759.3558, GNorm = 0.7007, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.080297
Epoch 8423
Loss = 2.7620e-02, PNorm = 759.3967, GNorm = 0.4502, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.096195
Epoch 8424
Loss = 4.4046e-02, PNorm = 759.4337, GNorm = 1.8589, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.087066
Epoch 8425
Loss = 2.3959e-02, PNorm = 759.4638, GNorm = 5.8464, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.089814
Epoch 8426
Loss = 5.6835e-02, PNorm = 759.4967, GNorm = 4.2770, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.097679
Epoch 8427
Loss = 6.3634e-02, PNorm = 759.5465, GNorm = 0.8584, lr_0 = 9.9410e-04
Validation binary_cross_entropy = 0.102808
Epoch 8428
Loss = 3.1803e-02, PNorm = 759.5750, GNorm = 3.4661, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.101557
Epoch 8429
Loss = 2.3386e-02, PNorm = 759.6025, GNorm = 1.5749, lr_0 = 9.9409e-04
Loss = 1.0492e-02, PNorm = 759.6340, GNorm = 0.3618, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.106011
Epoch 8430
Loss = 3.4335e-02, PNorm = 759.6576, GNorm = 0.9477, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.109721
Epoch 8431
Loss = 1.3177e-02, PNorm = 759.6713, GNorm = 0.1577, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.098732
Epoch 8432
Loss = 9.0814e-03, PNorm = 759.6835, GNorm = 0.2704, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.100659
Epoch 8433
Loss = 5.6348e-02, PNorm = 759.7049, GNorm = 0.0437, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.119455
Epoch 8434
Loss = 1.4388e-02, PNorm = 759.7275, GNorm = 1.9069, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.103396
Epoch 8435
Loss = 7.3727e-02, PNorm = 759.7503, GNorm = 1.7426, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.098644
Epoch 8436
Loss = 3.1834e-02, PNorm = 759.7814, GNorm = 1.2716, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.095522
Epoch 8437
Loss = 4.3390e-02, PNorm = 759.8095, GNorm = 1.4111, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.094355
Epoch 8438
Loss = 3.0722e-03, PNorm = 759.8316, GNorm = 0.2241, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.096934
Epoch 8439
Loss = 1.0093e-01, PNorm = 759.8568, GNorm = 2.0366, lr_0 = 9.9409e-04
Loss = 1.4479e-02, PNorm = 759.8807, GNorm = 0.7674, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.097741
Epoch 8440
Loss = 2.3457e-02, PNorm = 759.9029, GNorm = 0.1381, lr_0 = 9.9409e-04
Validation binary_cross_entropy = 0.088356
Epoch 8441
Loss = 3.1231e-02, PNorm = 759.9425, GNorm = 0.3208, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.095645
Epoch 8442
Loss = 2.1914e-02, PNorm = 759.9829, GNorm = 0.3053, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.101549
Epoch 8443
Loss = 1.5175e-02, PNorm = 760.0171, GNorm = 2.1510, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.100869
Epoch 8444
Loss = 4.2433e-02, PNorm = 760.0427, GNorm = 0.0886, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.094421
Epoch 8445
Loss = 5.2887e-02, PNorm = 760.0755, GNorm = 1.4432, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.098911
Epoch 8446
Loss = 1.7671e-02, PNorm = 760.1063, GNorm = 1.1986, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.091125
Epoch 8447
Loss = 3.1719e-02, PNorm = 760.1278, GNorm = 3.2551, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.089585
Epoch 8448
Loss = 3.1213e-02, PNorm = 760.1643, GNorm = 8.0050, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.101123
Epoch 8449
Loss = 1.5536e-01, PNorm = 760.1978, GNorm = 9.4979, lr_0 = 9.9408e-04
Loss = 5.4293e-02, PNorm = 760.2270, GNorm = 0.8356, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.092209
Epoch 8450
Loss = 6.0533e-02, PNorm = 760.2696, GNorm = 15.0805, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.092624
Epoch 8451
Loss = 2.0715e-02, PNorm = 760.3451, GNorm = 0.2068, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.093408
Epoch 8452
Loss = 2.8614e-02, PNorm = 760.4164, GNorm = 0.1364, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.114974
Epoch 8453
Loss = 1.5077e-01, PNorm = 760.4653, GNorm = 11.9113, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.101213
Epoch 8454
Loss = 7.9148e-02, PNorm = 760.5155, GNorm = 0.1694, lr_0 = 9.9408e-04
Validation binary_cross_entropy = 0.084237
Epoch 8455
Loss = 4.0679e-02, PNorm = 760.5655, GNorm = 5.2934, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.091823
Epoch 8456
Loss = 4.3480e-02, PNorm = 760.6316, GNorm = 4.5728, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.097199
Epoch 8457
Loss = 7.0338e-03, PNorm = 760.6942, GNorm = 1.1461, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.104163
Epoch 8458
Loss = 1.5664e-02, PNorm = 760.7400, GNorm = 0.9208, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.101343
Epoch 8459
Loss = 6.2629e-03, PNorm = 760.7737, GNorm = 0.5565, lr_0 = 9.9407e-04
Loss = 2.1363e-02, PNorm = 760.8011, GNorm = 1.5006, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.099907
Epoch 8460
Loss = 3.5056e-02, PNorm = 760.8209, GNorm = 2.0219, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.097259
Epoch 8461
Loss = 5.0459e-02, PNorm = 760.8519, GNorm = 1.3975, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.099769
Epoch 8462
Loss = 4.8490e-02, PNorm = 760.8893, GNorm = 0.1536, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.105486
Epoch 8463
Loss = 6.0752e-02, PNorm = 760.9348, GNorm = 1.3522, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.119436
Epoch 8464
Loss = 1.0645e-01, PNorm = 760.9693, GNorm = 10.4132, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.103759
Epoch 8465
Loss = 4.9491e-02, PNorm = 761.0195, GNorm = 2.7325, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.161677
Epoch 8466
Loss = 6.9925e-02, PNorm = 761.0818, GNorm = 2.0472, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.114966
Epoch 8467
Loss = 5.9036e-03, PNorm = 761.1305, GNorm = 0.1648, lr_0 = 9.9407e-04
Validation binary_cross_entropy = 0.091159
Epoch 8468
Loss = 5.9230e-03, PNorm = 761.1761, GNorm = 0.1834, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.098368
Epoch 8469
Loss = 6.6788e-02, PNorm = 761.2175, GNorm = 2.0395, lr_0 = 9.9406e-04
Loss = 2.4897e-02, PNorm = 761.2509, GNorm = 0.0270, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.153194
Epoch 8470
Loss = 5.3820e-03, PNorm = 761.2800, GNorm = 0.0073, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.378718
Epoch 8471
Loss = 1.9747e-02, PNorm = 761.3137, GNorm = 0.5603, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.165222
Epoch 8472
Loss = 1.2746e-01, PNorm = 761.3585, GNorm = 0.2809, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.163079
Epoch 8473
Loss = 7.9568e-02, PNorm = 761.3977, GNorm = 1.9910, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.093515
Epoch 8474
Loss = 4.0445e-02, PNorm = 761.4300, GNorm = 2.1676, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.081150
Epoch 8475
Loss = 2.7380e-02, PNorm = 761.4678, GNorm = 0.5109, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.095036
Epoch 8476
Loss = 5.3509e-02, PNorm = 761.4962, GNorm = 2.4239, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.093605
Epoch 8477
Loss = 2.8579e-02, PNorm = 761.5205, GNorm = 2.3795, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.085015
Epoch 8478
Loss = 5.1007e-02, PNorm = 761.5433, GNorm = 3.2950, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.090346
Epoch 8479
Loss = 3.9596e-03, PNorm = 761.5817, GNorm = 0.2158, lr_0 = 9.9406e-04
Loss = 1.4671e-02, PNorm = 761.6096, GNorm = 5.3343, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.089391
Epoch 8480
Loss = 1.0737e-02, PNorm = 761.6380, GNorm = 0.1531, lr_0 = 9.9406e-04
Validation binary_cross_entropy = 0.099461
Epoch 8481
Loss = 7.9723e-03, PNorm = 761.6602, GNorm = 1.2905, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.108434
Epoch 8482
Loss = 1.3948e-02, PNorm = 761.6825, GNorm = 0.2221, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.112355
Epoch 8483
Loss = 6.1744e-02, PNorm = 761.7076, GNorm = 1.0624, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.106359
Epoch 8484
Loss = 3.5916e-02, PNorm = 761.7375, GNorm = 1.4929, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.093505
Epoch 8485
Loss = 4.4572e-02, PNorm = 761.8009, GNorm = 0.8920, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.127653
Epoch 8486
Loss = 9.2618e-02, PNorm = 761.8542, GNorm = 2.1278, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.075608
Epoch 8487
Loss = 3.2420e-02, PNorm = 761.8953, GNorm = 0.4256, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.074682
Epoch 8488
Loss = 8.1315e-03, PNorm = 761.9565, GNorm = 0.3375, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.092457
Epoch 8489
Loss = 5.3076e-03, PNorm = 762.0228, GNorm = 0.6380, lr_0 = 9.9405e-04
Loss = 6.3430e-02, PNorm = 762.0596, GNorm = 1.7786, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.078753
Epoch 8490
Loss = 7.9940e-02, PNorm = 762.0955, GNorm = 0.3377, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.068164
Epoch 8491
Loss = 5.4141e-02, PNorm = 762.1659, GNorm = 1.2871, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.076240
Epoch 8492
Loss = 4.1738e-02, PNorm = 762.2267, GNorm = 0.2575, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.076018
Epoch 8493
Loss = 3.9502e-02, PNorm = 762.2734, GNorm = 0.1445, lr_0 = 9.9405e-04
Validation binary_cross_entropy = 0.074476
Epoch 8494
Loss = 2.3024e-02, PNorm = 762.3116, GNorm = 1.9698, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.074289
Epoch 8495
Loss = 1.0258e-02, PNorm = 762.3581, GNorm = 0.1255, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.092842
Epoch 8496
Loss = 4.7423e-02, PNorm = 762.3969, GNorm = 0.8287, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.077023
Epoch 8497
Loss = 1.4940e-02, PNorm = 762.4209, GNorm = 1.6612, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.076401
Epoch 8498
Loss = 1.1734e-02, PNorm = 762.4492, GNorm = 0.3463, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.078533
Epoch 8499
Loss = 3.6122e-03, PNorm = 762.4892, GNorm = 0.3768, lr_0 = 9.9404e-04
Loss = 1.7257e-02, PNorm = 762.5155, GNorm = 0.2412, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.082020
Epoch 8500
Loss = 5.0570e-02, PNorm = 762.5262, GNorm = 0.0487, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.073611
Epoch 8501
Loss = 1.5647e-02, PNorm = 762.5478, GNorm = 1.4409, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.075211
Epoch 8502
Loss = 2.6442e-02, PNorm = 762.5826, GNorm = 5.6217, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.080358
Epoch 8503
Loss = 1.4769e-02, PNorm = 762.6122, GNorm = 0.0813, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.086284
Epoch 8504
Loss = 4.7163e-02, PNorm = 762.6473, GNorm = 1.4418, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.101930
Epoch 8505
Loss = 2.1491e-02, PNorm = 762.6857, GNorm = 1.4949, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.121748
Epoch 8506
Loss = 1.2605e-01, PNorm = 762.7124, GNorm = 2.8235, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.086792
Epoch 8507
Loss = 2.4777e-02, PNorm = 762.7414, GNorm = 1.0806, lr_0 = 9.9404e-04
Validation binary_cross_entropy = 0.076172
Epoch 8508
Loss = 2.6614e-02, PNorm = 762.7669, GNorm = 2.4531, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.077488
Epoch 8509
Loss = 1.1865e-02, PNorm = 762.7997, GNorm = 1.4063, lr_0 = 9.9403e-04
Loss = 2.8702e-02, PNorm = 762.8323, GNorm = 0.2613, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.074169
Epoch 8510
Loss = 4.3971e-02, PNorm = 762.8677, GNorm = 0.9549, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.074104
Epoch 8511
Loss = 4.2004e-03, PNorm = 762.9004, GNorm = 0.1032, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.077940
Epoch 8512
Loss = 5.2716e-02, PNorm = 762.9189, GNorm = 0.5143, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.075973
Epoch 8513
Loss = 1.2370e-02, PNorm = 762.9400, GNorm = 1.2316, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.171530
Epoch 8514
Loss = 7.7499e-02, PNorm = 762.9709, GNorm = 0.5125, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.081590
Epoch 8515
Loss = 1.8876e-02, PNorm = 763.0088, GNorm = 0.0891, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.087954
Epoch 8516
Loss = 4.3312e-02, PNorm = 763.0432, GNorm = 2.6134, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.080063
Epoch 8517
Loss = 1.3420e-02, PNorm = 763.0943, GNorm = 1.1885, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.081148
Epoch 8518
Loss = 8.5358e-03, PNorm = 763.1345, GNorm = 0.2878, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.080422
Epoch 8519
Loss = 4.3427e-03, PNorm = 763.1572, GNorm = 0.2560, lr_0 = 9.9403e-04
Loss = 1.0347e-02, PNorm = 763.1837, GNorm = 0.1815, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.081611
Epoch 8520
Loss = 2.1906e-02, PNorm = 763.2071, GNorm = 0.2027, lr_0 = 9.9403e-04
Validation binary_cross_entropy = 0.082425
Epoch 8521
Loss = 2.9661e-01, PNorm = 763.2637, GNorm = 5.4494, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.066229
Epoch 8522
Loss = 7.7955e-02, PNorm = 763.3647, GNorm = 2.7445, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.098642
Epoch 8523
Loss = 1.0781e-01, PNorm = 763.4302, GNorm = 13.0140, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.098541
Epoch 8524
Loss = 1.0303e-01, PNorm = 763.5002, GNorm = 2.6512, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.073756
Epoch 8525
Loss = 1.1418e-01, PNorm = 763.5571, GNorm = 13.7480, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.074581
Epoch 8526
Loss = 9.9649e-02, PNorm = 763.6511, GNorm = 1.2667, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.135202
Epoch 8527
Loss = 2.4765e-02, PNorm = 763.7266, GNorm = 1.2859, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.110096
Epoch 8528
Loss = 3.5509e-02, PNorm = 763.7789, GNorm = 2.1688, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.104755
Epoch 8529
Loss = 9.8500e-02, PNorm = 763.8340, GNorm = 4.1502, lr_0 = 9.9402e-04
Loss = 4.4816e-02, PNorm = 763.8879, GNorm = 0.5469, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.093303
Epoch 8530
Loss = 3.5520e-02, PNorm = 763.9354, GNorm = 1.1778, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.090055
Epoch 8531
Loss = 4.0112e-02, PNorm = 763.9777, GNorm = 1.0848, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.093648
Epoch 8532
Loss = 1.7766e-02, PNorm = 764.0149, GNorm = 0.1859, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.103931
Epoch 8533
Loss = 3.0890e-02, PNorm = 764.0583, GNorm = 0.5945, lr_0 = 9.9402e-04
Validation binary_cross_entropy = 0.105692
Epoch 8534
Loss = 4.4355e-02, PNorm = 764.1095, GNorm = 13.6793, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.099154
Epoch 8535
Loss = 5.0741e-02, PNorm = 764.1896, GNorm = 0.8419, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.112017
Epoch 8536
Loss = 1.9149e-02, PNorm = 764.2575, GNorm = 1.9906, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.101381
Epoch 8537
Loss = 2.0831e-02, PNorm = 764.3055, GNorm = 2.3250, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.095803
Epoch 8538
Loss = 1.3619e-01, PNorm = 764.3422, GNorm = 3.6864, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.083714
Epoch 8539
Loss = 2.2527e-02, PNorm = 764.3870, GNorm = 1.1787, lr_0 = 9.9401e-04
Loss = 6.4654e-02, PNorm = 764.4364, GNorm = 0.3648, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.081744
Epoch 8540
Loss = 4.7478e-02, PNorm = 764.4874, GNorm = 0.4717, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.085935
Epoch 8541
Loss = 2.1925e-02, PNorm = 764.5364, GNorm = 2.0579, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.098075
Epoch 8542
Loss = 2.4732e-02, PNorm = 764.5688, GNorm = 0.7076, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.097763
Epoch 8543
Loss = 9.8816e-02, PNorm = 764.6065, GNorm = 4.4382, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.101006
Epoch 8544
Loss = 1.3354e-02, PNorm = 764.6621, GNorm = 0.9431, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.112500
Epoch 8545
Loss = 2.5332e-01, PNorm = 764.7098, GNorm = 3.4899, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.087744
Epoch 8546
Loss = 2.3206e-02, PNorm = 764.7628, GNorm = 1.0248, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.079076
Epoch 8547
Loss = 5.6502e-02, PNorm = 764.8191, GNorm = 0.3839, lr_0 = 9.9401e-04
Validation binary_cross_entropy = 0.091424
Epoch 8548
Loss = 8.1423e-02, PNorm = 764.8781, GNorm = 1.2481, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.075380
Epoch 8549
Loss = 3.3016e-03, PNorm = 764.9313, GNorm = 0.3381, lr_0 = 9.9400e-04
Loss = 5.8396e-02, PNorm = 764.9767, GNorm = 2.5744, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.090910
Epoch 8550
Loss = 3.8606e-02, PNorm = 765.0105, GNorm = 1.2306, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.072193
Epoch 8551
Loss = 1.8050e-02, PNorm = 765.0427, GNorm = 1.8467, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.076284
Epoch 8552
Loss = 2.1451e-02, PNorm = 765.0736, GNorm = 0.8367, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.081965
Epoch 8553
Loss = 6.8073e-03, PNorm = 765.0970, GNorm = 0.0912, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.086811
Epoch 8554
Loss = 9.6256e-02, PNorm = 765.1118, GNorm = 0.7692, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.068828
Epoch 8555
Loss = 3.5060e-02, PNorm = 765.1487, GNorm = 0.3667, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.070076
Epoch 8556
Loss = 1.5243e-02, PNorm = 765.1800, GNorm = 0.7651, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.070902
Epoch 8557
Loss = 1.4349e-02, PNorm = 765.2070, GNorm = 1.2354, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.094181
Epoch 8558
Loss = 3.6109e-03, PNorm = 765.2556, GNorm = 0.1338, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.104255
Epoch 8559
Loss = 4.0156e-03, PNorm = 765.2961, GNorm = 0.2342, lr_0 = 9.9400e-04
Loss = 1.8191e-02, PNorm = 765.3253, GNorm = 3.7035, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.101801
Epoch 8560
Loss = 2.1343e-02, PNorm = 765.3871, GNorm = 1.4705, lr_0 = 9.9400e-04
Validation binary_cross_entropy = 0.124148
Epoch 8561
Loss = 4.6249e-02, PNorm = 765.4328, GNorm = 1.1091, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.129217
Epoch 8562
Loss = 4.7310e-02, PNorm = 765.4598, GNorm = 0.1137, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.109458
Epoch 8563
Loss = 8.6052e-03, PNorm = 765.4927, GNorm = 0.0926, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.107174
Epoch 8564
Loss = 7.4925e-02, PNorm = 765.5327, GNorm = 5.7685, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.104776
Epoch 8565
Loss = 5.7270e-02, PNorm = 765.5724, GNorm = 0.3103, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.089826
Epoch 8566
Loss = 1.5305e-02, PNorm = 765.6095, GNorm = 2.0337, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.100219
Epoch 8567
Loss = 2.0376e-03, PNorm = 765.6637, GNorm = 0.0707, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.109512
Epoch 8568
Loss = 5.4946e-02, PNorm = 765.6932, GNorm = 2.1292, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.093304
Epoch 8569
Loss = 1.7300e-02, PNorm = 765.7190, GNorm = 0.7646, lr_0 = 9.9399e-04
Loss = 1.9331e-02, PNorm = 765.7559, GNorm = 2.6705, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.119397
Epoch 8570
Loss = 7.1695e-02, PNorm = 765.7938, GNorm = 0.1738, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.112664
Epoch 8571
Loss = 5.6151e-03, PNorm = 765.8266, GNorm = 0.2069, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.105274
Epoch 8572
Loss = 2.7078e-02, PNorm = 765.8522, GNorm = 1.0087, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.100563
Epoch 8573
Loss = 3.5617e-02, PNorm = 765.8858, GNorm = 0.1771, lr_0 = 9.9399e-04
Validation binary_cross_entropy = 0.108432
Epoch 8574
Loss = 5.8190e-03, PNorm = 765.9220, GNorm = 0.0663, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.133173
Epoch 8575
Loss = 2.2219e-02, PNorm = 765.9749, GNorm = 4.8635, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.137319
Epoch 8576
Loss = 2.1151e-02, PNorm = 766.0345, GNorm = 0.8539, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.120452
Epoch 8577
Loss = 2.9715e-02, PNorm = 766.0806, GNorm = 1.2617, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.093781
Epoch 8578
Loss = 4.7949e-02, PNorm = 766.1245, GNorm = 0.9263, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.091416
Epoch 8579
Loss = 7.6734e-03, PNorm = 766.1706, GNorm = 0.5737, lr_0 = 9.9398e-04
Loss = 3.5016e-02, PNorm = 766.2121, GNorm = 0.9836, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.092399
Epoch 8580
Loss = 3.4654e-02, PNorm = 766.2421, GNorm = 3.2864, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.093015
Epoch 8581
Loss = 3.7495e-02, PNorm = 766.2815, GNorm = 0.2698, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.099054
Epoch 8582
Loss = 2.3744e-02, PNorm = 766.3130, GNorm = 0.0671, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.124795
Epoch 8583
Loss = 1.4752e-01, PNorm = 766.3373, GNorm = 1.9929, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.074878
Epoch 8584
Loss = 2.1518e-02, PNorm = 766.3719, GNorm = 0.5623, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.071030
Epoch 8585
Loss = 3.0234e-02, PNorm = 766.4166, GNorm = 0.4145, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.082550
Epoch 8586
Loss = 3.3949e-03, PNorm = 766.4590, GNorm = 0.6805, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.084673
Epoch 8587
Loss = 4.5295e-02, PNorm = 766.4869, GNorm = 3.6207, lr_0 = 9.9398e-04
Validation binary_cross_entropy = 0.074840
Epoch 8588
Loss = 6.0006e-03, PNorm = 766.5303, GNorm = 0.6354, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.082146
Epoch 8589
Loss = 7.7059e-03, PNorm = 766.5737, GNorm = 0.3619, lr_0 = 9.9397e-04
Loss = 3.1924e-02, PNorm = 766.6044, GNorm = 5.5941, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.098420
Epoch 8590
Loss = 6.1342e-03, PNorm = 766.6336, GNorm = 0.6959, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.120193
Epoch 8591
Loss = 4.6044e-02, PNorm = 766.6499, GNorm = 0.7765, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.096047
Epoch 8592
Loss = 1.8100e-01, PNorm = 766.6719, GNorm = 0.3128, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.078259
Epoch 8593
Loss = 4.5360e-02, PNorm = 766.7152, GNorm = 3.9994, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.093199
Epoch 8594
Loss = 5.4930e-02, PNorm = 766.7571, GNorm = 1.4243, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.081889
Epoch 8595
Loss = 2.3761e-02, PNorm = 766.7994, GNorm = 2.2474, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.087579
Epoch 8596
Loss = 3.4877e-02, PNorm = 766.8386, GNorm = 0.0816, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.091711
Epoch 8597
Loss = 1.0959e-02, PNorm = 766.8704, GNorm = 1.2100, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.096000
Epoch 8598
Loss = 3.2701e-03, PNorm = 766.8914, GNorm = 0.0116, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.077473
Epoch 8599
Loss = 2.3347e-03, PNorm = 766.9098, GNorm = 0.1871, lr_0 = 9.9397e-04
Loss = 3.0089e-02, PNorm = 766.9432, GNorm = 0.9107, lr_0 = 9.9397e-04
Validation binary_cross_entropy = 0.079594
Epoch 8600
Loss = 4.0199e-02, PNorm = 766.9783, GNorm = 1.8118, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.081014
Epoch 8601
Loss = 9.2780e-03, PNorm = 767.0078, GNorm = 0.2248, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.090139
Epoch 8602
Loss = 1.6828e-02, PNorm = 767.0303, GNorm = 0.1696, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.226999
Epoch 8603
Loss = 9.7162e-02, PNorm = 767.0437, GNorm = 0.2800, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.088616
Epoch 8604
Loss = 2.4079e-02, PNorm = 767.0951, GNorm = 0.1017, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.107953
Epoch 8605
Loss = 1.3323e-02, PNorm = 767.1450, GNorm = 0.4047, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.066984
Epoch 8606
Loss = 3.2274e-02, PNorm = 767.2041, GNorm = 3.3969, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.085492
Epoch 8607
Loss = 3.9431e-02, PNorm = 767.3031, GNorm = 4.5224, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.087785
Epoch 8608
Loss = 1.0953e-01, PNorm = 767.3855, GNorm = 4.7363, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.100171
Epoch 8609
Loss = 1.2216e-01, PNorm = 767.4531, GNorm = 4.6111, lr_0 = 9.9396e-04
Loss = 5.2913e-02, PNorm = 767.5046, GNorm = 0.3160, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.126459
Epoch 8610
Loss = 5.6463e-02, PNorm = 767.5447, GNorm = 11.1037, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.139893
Epoch 8611
Loss = 7.4180e-02, PNorm = 767.6062, GNorm = 0.6464, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.088217
Epoch 8612
Loss = 4.8872e-02, PNorm = 767.6622, GNorm = 0.6852, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.074126
Epoch 8613
Loss = 4.5619e-02, PNorm = 767.7117, GNorm = 0.5338, lr_0 = 9.9396e-04
Validation binary_cross_entropy = 0.119960
Epoch 8614
Loss = 3.1242e-02, PNorm = 767.7561, GNorm = 0.6707, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.095251
Epoch 8615
Loss = 3.7192e-02, PNorm = 767.7842, GNorm = 3.3631, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.090167
Epoch 8616
Loss = 4.6920e-02, PNorm = 767.8167, GNorm = 2.1232, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.088458
Epoch 8617
Loss = 7.7327e-03, PNorm = 767.8410, GNorm = 0.2419, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.084691
Epoch 8618
Loss = 1.9122e-02, PNorm = 767.8658, GNorm = 0.2557, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.083629
Epoch 8619
Loss = 4.2033e-02, PNorm = 767.8912, GNorm = 3.4901, lr_0 = 9.9395e-04
Loss = 1.4301e-02, PNorm = 767.9310, GNorm = 0.5704, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.097503
Epoch 8620
Loss = 8.8629e-02, PNorm = 767.9707, GNorm = 1.3731, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.107757
Epoch 8621
Loss = 2.2622e-02, PNorm = 768.0013, GNorm = 0.2395, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.126816
Epoch 8622
Loss = 5.8272e-02, PNorm = 768.0359, GNorm = 0.1675, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.123535
Epoch 8623
Loss = 7.9287e-02, PNorm = 768.0719, GNorm = 1.0854, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.076335
Epoch 8624
Loss = 7.4189e-02, PNorm = 768.1261, GNorm = 0.4901, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.087165
Epoch 8625
Loss = 3.9595e-02, PNorm = 768.1822, GNorm = 2.8157, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.087894
Epoch 8626
Loss = 9.5336e-03, PNorm = 768.2210, GNorm = 0.2101, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.089595
Epoch 8627
Loss = 8.1330e-02, PNorm = 768.2548, GNorm = 0.2190, lr_0 = 9.9395e-04
Validation binary_cross_entropy = 0.087773
Epoch 8628
Loss = 4.8043e-03, PNorm = 768.2892, GNorm = 0.1210, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.091221
Epoch 8629
Loss = 9.8405e-03, PNorm = 768.3365, GNorm = 0.3099, lr_0 = 9.9394e-04
Loss = 6.5928e-02, PNorm = 768.3771, GNorm = 0.1999, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.096387
Epoch 8630
Loss = 2.0797e-01, PNorm = 768.4300, GNorm = 0.7486, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.077316
Epoch 8631
Loss = 6.8792e-02, PNorm = 768.5028, GNorm = 2.8714, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.080468
Epoch 8632
Loss = 1.8332e-02, PNorm = 768.5572, GNorm = 1.2161, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.078415
Epoch 8633
Loss = 1.6970e-02, PNorm = 768.5985, GNorm = 0.8701, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.086466
Epoch 8634
Loss = 4.0262e-02, PNorm = 768.6266, GNorm = 0.1127, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.081738
Epoch 8635
Loss = 1.1120e-02, PNorm = 768.6444, GNorm = 1.3210, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.080693
Epoch 8636
Loss = 6.5955e-02, PNorm = 768.6680, GNorm = 7.0485, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.089568
Epoch 8637
Loss = 2.3946e-02, PNorm = 768.6969, GNorm = 1.1820, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.081979
Epoch 8638
Loss = 4.3774e-02, PNorm = 768.7191, GNorm = 1.5635, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.076981
Epoch 8639
Loss = 1.2084e-01, PNorm = 768.7512, GNorm = 6.0130, lr_0 = 9.9394e-04
Loss = 3.8560e-02, PNorm = 768.7963, GNorm = 5.9052, lr_0 = 9.9394e-04
Validation binary_cross_entropy = 0.096184
Epoch 8640
Loss = 1.1293e-01, PNorm = 768.8368, GNorm = 7.7367, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.090056
Epoch 8641
Loss = 3.8278e-02, PNorm = 768.8744, GNorm = 0.6780, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.070783
Epoch 8642
Loss = 1.5776e-02, PNorm = 768.9173, GNorm = 0.6529, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.081052
Epoch 8643
Loss = 1.1365e-02, PNorm = 768.9600, GNorm = 0.3170, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.103505
Epoch 8644
Loss = 9.0352e-03, PNorm = 768.9905, GNorm = 0.1948, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.166607
Epoch 8645
Loss = 3.3139e-02, PNorm = 769.0129, GNorm = 0.1411, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.149961
Epoch 8646
Loss = 4.2860e-01, PNorm = 769.0281, GNorm = 30.0510, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.092020
Epoch 8647
Loss = 3.3330e-02, PNorm = 769.0763, GNorm = 4.6007, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.109898
Epoch 8648
Loss = 2.4305e-02, PNorm = 769.1440, GNorm = 2.1918, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.127024
Epoch 8649
Loss = 8.3376e-02, PNorm = 769.1829, GNorm = 2.5723, lr_0 = 9.9393e-04
Loss = 5.9731e-02, PNorm = 769.2144, GNorm = 0.4668, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.109679
Epoch 8650
Loss = 4.2755e-02, PNorm = 769.2406, GNorm = 0.0596, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.079425
Epoch 8651
Loss = 4.8242e-02, PNorm = 769.2716, GNorm = 1.3025, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.086458
Epoch 8652
Loss = 3.3372e-02, PNorm = 769.3007, GNorm = 1.7241, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.092720
Epoch 8653
Loss = 5.4116e-02, PNorm = 769.3289, GNorm = 0.9932, lr_0 = 9.9393e-04
Validation binary_cross_entropy = 0.095026
Epoch 8654
Loss = 2.9618e-02, PNorm = 769.3605, GNorm = 4.0182, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.091939
Epoch 8655
Loss = 3.5630e-02, PNorm = 769.3969, GNorm = 1.1677, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.077406
Epoch 8656
Loss = 5.5864e-02, PNorm = 769.4446, GNorm = 1.1421, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.094261
Epoch 8657
Loss = 1.2502e-02, PNorm = 769.4893, GNorm = 1.6672, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.095884
Epoch 8658
Loss = 4.9401e-02, PNorm = 769.5224, GNorm = 1.6287, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.080042
Epoch 8659
Loss = 6.5814e-02, PNorm = 769.5467, GNorm = 2.0662, lr_0 = 9.9392e-04
Loss = 1.9822e-02, PNorm = 769.5831, GNorm = 1.1115, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.092254
Epoch 8660
Loss = 2.6913e-02, PNorm = 769.6095, GNorm = 0.3455, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.093015
Epoch 8661
Loss = 2.9657e-02, PNorm = 769.6272, GNorm = 0.3481, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.086190
Epoch 8662
Loss = 1.6021e-01, PNorm = 769.6537, GNorm = 0.3661, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.073226
Epoch 8663
Loss = 2.2117e-02, PNorm = 769.6907, GNorm = 0.2831, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.087802
Epoch 8664
Loss = 1.7881e-02, PNorm = 769.7192, GNorm = 1.8634, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.087612
Epoch 8665
Loss = 8.2714e-03, PNorm = 769.7383, GNorm = 0.1415, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.094686
Epoch 8666
Loss = 1.1172e-02, PNorm = 769.7574, GNorm = 0.1248, lr_0 = 9.9392e-04
Validation binary_cross_entropy = 0.096145
Epoch 8667
Loss = 1.6172e-02, PNorm = 769.7772, GNorm = 1.0942, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.102097
Epoch 8668
Loss = 2.3440e-02, PNorm = 769.7935, GNorm = 2.6917, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.090643
Epoch 8669
Loss = 8.8456e-03, PNorm = 769.8076, GNorm = 0.9326, lr_0 = 9.9391e-04
Loss = 1.5195e-02, PNorm = 769.8252, GNorm = 0.2623, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.096743
Epoch 8670
Loss = 3.8587e-02, PNorm = 769.8340, GNorm = 1.1187, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.086171
Epoch 8671
Loss = 5.9067e-02, PNorm = 769.8654, GNorm = 1.3246, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.111760
Epoch 8672
Loss = 3.8527e-02, PNorm = 769.8965, GNorm = 3.2269, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.128614
Epoch 8673
Loss = 4.0252e-02, PNorm = 769.9126, GNorm = 0.7908, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.107566
Epoch 8674
Loss = 4.2452e-02, PNorm = 769.9292, GNorm = 0.1113, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.085156
Epoch 8675
Loss = 1.4002e-02, PNorm = 769.9501, GNorm = 0.1071, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.088281
Epoch 8676
Loss = 1.9086e-02, PNorm = 769.9785, GNorm = 0.2157, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.085544
Epoch 8677
Loss = 2.4135e-02, PNorm = 770.0109, GNorm = 0.1805, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.086550
Epoch 8678
Loss = 4.4555e-03, PNorm = 770.0452, GNorm = 0.1882, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.083927
Epoch 8679
Loss = 2.9986e-03, PNorm = 770.0718, GNorm = 0.1579, lr_0 = 9.9391e-04
Loss = 1.4292e-02, PNorm = 770.1022, GNorm = 1.0024, lr_0 = 9.9391e-04
Validation binary_cross_entropy = 0.086301
Epoch 8680
Loss = 2.9646e-02, PNorm = 770.1288, GNorm = 2.7851, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.088176
Epoch 8681
Loss = 5.8022e-02, PNorm = 770.1566, GNorm = 1.1553, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.084245
Epoch 8682
Loss = 3.0775e-02, PNorm = 770.1951, GNorm = 2.9519, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.079813
Epoch 8683
Loss = 1.0706e-02, PNorm = 770.2315, GNorm = 1.5860, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.089181
Epoch 8684
Loss = 2.7425e-02, PNorm = 770.2710, GNorm = 0.0940, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.109204
Epoch 8685
Loss = 1.0100e-02, PNorm = 770.3157, GNorm = 0.8144, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.140645
Epoch 8686
Loss = 5.0857e-03, PNorm = 770.3573, GNorm = 0.0285, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.204413
Epoch 8687
Loss = 5.4246e-02, PNorm = 770.3902, GNorm = 7.0097, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.240144
Epoch 8688
Loss = 9.8553e-02, PNorm = 770.4269, GNorm = 0.0201, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.187228
Epoch 8689
Loss = 8.9055e-03, PNorm = 770.4609, GNorm = 0.7031, lr_0 = 9.9390e-04
Loss = 4.1238e-02, PNorm = 770.4968, GNorm = 4.4835, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.162638
Epoch 8690
Loss = 6.0379e-02, PNorm = 770.5449, GNorm = 0.9906, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.171810
Epoch 8691
Loss = 1.5339e-02, PNorm = 770.5882, GNorm = 0.0883, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.161589
Epoch 8692
Loss = 1.3764e-02, PNorm = 770.6156, GNorm = 0.5269, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.150770
Epoch 8693
Loss = 2.1867e-02, PNorm = 770.6472, GNorm = 0.0910, lr_0 = 9.9390e-04
Validation binary_cross_entropy = 0.136111
Epoch 8694
Loss = 2.0721e-01, PNorm = 770.6894, GNorm = 6.3455, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.099215
Epoch 8695
Loss = 3.5906e-02, PNorm = 770.7565, GNorm = 0.7299, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.100060
Epoch 8696
Loss = 1.0892e-02, PNorm = 770.8050, GNorm = 0.3734, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.096398
Epoch 8697
Loss = 9.1435e-03, PNorm = 770.8460, GNorm = 0.5409, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.097386
Epoch 8698
Loss = 3.6209e-02, PNorm = 770.8843, GNorm = 2.0705, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.111831
Epoch 8699
Loss = 1.6182e-02, PNorm = 770.9285, GNorm = 1.0247, lr_0 = 9.9389e-04
Loss = 7.7455e-02, PNorm = 770.9580, GNorm = 2.4377, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.097151
Epoch 8700
Loss = 2.0932e-02, PNorm = 770.9935, GNorm = 0.3538, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.088061
Epoch 8701
Loss = 4.5574e-02, PNorm = 771.0484, GNorm = 7.3203, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.096999
Epoch 8702
Loss = 2.7472e-02, PNorm = 771.1004, GNorm = 1.0449, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.103441
Epoch 8703
Loss = 1.8086e-02, PNorm = 771.1398, GNorm = 0.2966, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.124251
Epoch 8704
Loss = 1.2160e-02, PNorm = 771.1718, GNorm = 0.2520, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.121422
Epoch 8705
Loss = 3.3510e-01, PNorm = 771.2106, GNorm = 0.6810, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.070027
Epoch 8706
Loss = 4.2540e-02, PNorm = 771.3017, GNorm = 0.1625, lr_0 = 9.9389e-04
Validation binary_cross_entropy = 0.109084
Epoch 8707
Loss = 7.0706e-02, PNorm = 771.3723, GNorm = 2.1936, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.082286
Epoch 8708
Loss = 1.1513e-02, PNorm = 771.4232, GNorm = 0.4857, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.089089
Epoch 8709
Loss = 3.4391e-02, PNorm = 771.4725, GNorm = 1.2814, lr_0 = 9.9388e-04
Loss = 2.5541e-02, PNorm = 771.5086, GNorm = 0.2478, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.095543
Epoch 8710
Loss = 4.8079e-02, PNorm = 771.5388, GNorm = 0.1783, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.087809
Epoch 8711
Loss = 3.7568e-02, PNorm = 771.5797, GNorm = 1.2013, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.123663
Epoch 8712
Loss = 8.4502e-02, PNorm = 771.6091, GNorm = 0.2890, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.098155
Epoch 8713
Loss = 4.2592e-02, PNorm = 771.6324, GNorm = 1.3317, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.073886
Epoch 8714
Loss = 5.0988e-02, PNorm = 771.6653, GNorm = 1.0499, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.082107
Epoch 8715
Loss = 7.4284e-02, PNorm = 771.7102, GNorm = 12.8147, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.076932
Epoch 8716
Loss = 4.0898e-01, PNorm = 771.7737, GNorm = 0.8330, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.076230
Epoch 8717
Loss = 5.9371e-02, PNorm = 771.8949, GNorm = 1.3016, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.150008
Epoch 8718
Loss = 7.5183e-02, PNorm = 772.0049, GNorm = 2.4067, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.113514
Epoch 8719
Loss = 1.5258e-02, PNorm = 772.0822, GNorm = 1.3289, lr_0 = 9.9388e-04
Loss = 6.4448e-02, PNorm = 772.1489, GNorm = 2.2095, lr_0 = 9.9388e-04
Validation binary_cross_entropy = 0.155915
Epoch 8720
Loss = 1.1539e-01, PNorm = 772.1969, GNorm = 2.9416, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.137374
Epoch 8721
Loss = 3.4912e-02, PNorm = 772.2344, GNorm = 2.0566, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.114451
Epoch 8722
Loss = 5.8394e-02, PNorm = 772.2786, GNorm = 1.4386, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.134120
Epoch 8723
Loss = 4.6487e-02, PNorm = 772.3123, GNorm = 2.0281, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.114802
Epoch 8724
Loss = 1.7199e-02, PNorm = 772.3356, GNorm = 0.3776, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.108719
Epoch 8725
Loss = 3.7531e-02, PNorm = 772.3704, GNorm = 3.6561, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.128579
Epoch 8726
Loss = 2.4178e-02, PNorm = 772.4117, GNorm = 0.6729, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.129219
Epoch 8727
Loss = 8.8480e-03, PNorm = 772.4514, GNorm = 0.0872, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.123007
Epoch 8728
Loss = 4.8016e-03, PNorm = 772.4843, GNorm = 0.3906, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.136265
Epoch 8729
Loss = 1.5703e-03, PNorm = 772.5195, GNorm = 0.1156, lr_0 = 9.9387e-04
Loss = 4.0811e-02, PNorm = 772.5565, GNorm = 2.7144, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.129581
Epoch 8730
Loss = 6.7097e-02, PNorm = 772.6196, GNorm = 1.2414, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.110020
Epoch 8731
Loss = 4.5954e-02, PNorm = 772.7256, GNorm = 0.6538, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.128779
Epoch 8732
Loss = 2.1079e-02, PNorm = 772.7952, GNorm = 1.0182, lr_0 = 9.9387e-04
Validation binary_cross_entropy = 0.133324
Epoch 8733
Loss = 5.9674e-02, PNorm = 772.8396, GNorm = 1.4816, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.119173
Epoch 8734
Loss = 1.4568e-02, PNorm = 772.8749, GNorm = 0.9146, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.122568
Epoch 8735
Loss = 2.1995e-02, PNorm = 772.9122, GNorm = 0.0997, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.137349
Epoch 8736
Loss = 1.9584e-02, PNorm = 772.9511, GNorm = 0.4848, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.257602
Epoch 8737
Loss = 1.8867e-01, PNorm = 773.0222, GNorm = 11.8776, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.147402
Epoch 8738
Loss = 4.8673e-03, PNorm = 773.0900, GNorm = 0.3769, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.167371
Epoch 8739
Loss = 3.1328e-03, PNorm = 773.1635, GNorm = 0.1542, lr_0 = 9.9386e-04
Loss = 7.4265e-02, PNorm = 773.2334, GNorm = 0.1224, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.158056
Epoch 8740
Loss = 4.5283e-02, PNorm = 773.2832, GNorm = 7.3332, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.124445
Epoch 8741
Loss = 3.7964e-02, PNorm = 773.3406, GNorm = 0.4770, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.123911
Epoch 8742
Loss = 1.5123e-02, PNorm = 773.3954, GNorm = 0.2017, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.195632
Epoch 8743
Loss = 3.9124e-02, PNorm = 773.4423, GNorm = 0.1142, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.129925
Epoch 8744
Loss = 7.9155e-02, PNorm = 773.4882, GNorm = 0.3484, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.123620
Epoch 8745
Loss = 4.1571e-01, PNorm = 773.5636, GNorm = 1.5645, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.152975
Epoch 8746
Loss = 1.5391e-02, PNorm = 773.6410, GNorm = 2.2526, lr_0 = 9.9386e-04
Validation binary_cross_entropy = 0.173641
Epoch 8747
Loss = 2.0042e-01, PNorm = 773.7121, GNorm = 0.2996, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.112923
Epoch 8748
Loss = 3.0393e-02, PNorm = 773.7757, GNorm = 5.7039, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.125016
Epoch 8749
Loss = 2.7606e-01, PNorm = 773.8495, GNorm = 5.5842, lr_0 = 9.9385e-04
Loss = 4.4809e-02, PNorm = 773.9075, GNorm = 0.3364, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.115344
Epoch 8750
Loss = 8.3937e-02, PNorm = 773.9728, GNorm = 1.6999, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.100014
Epoch 8751
Loss = 2.7149e-02, PNorm = 774.0408, GNorm = 1.0323, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.094195
Epoch 8752
Loss = 7.9610e-02, PNorm = 774.0925, GNorm = 3.2238, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.106535
Epoch 8753
Loss = 8.9970e-02, PNorm = 774.1585, GNorm = 9.4417, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.133627
Epoch 8754
Loss = 7.7022e-02, PNorm = 774.2564, GNorm = 2.6905, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.159244
Epoch 8755
Loss = 9.4093e-02, PNorm = 774.3379, GNorm = 2.2085, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.136088
Epoch 8756
Loss = 9.6521e-02, PNorm = 774.3975, GNorm = 6.0507, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.128572
Epoch 8757
Loss = 4.5090e-02, PNorm = 774.4612, GNorm = 4.0809, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.131322
Epoch 8758
Loss = 1.3536e-02, PNorm = 774.5092, GNorm = 0.3178, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.109274
Epoch 8759
Loss = 3.4190e-03, PNorm = 774.5951, GNorm = 0.2206, lr_0 = 9.9385e-04
Loss = 1.2364e-01, PNorm = 774.6894, GNorm = 4.0120, lr_0 = 9.9385e-04
Validation binary_cross_entropy = 0.143181
Epoch 8760
Loss = 5.6895e-02, PNorm = 774.7497, GNorm = 1.2647, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.119127
Epoch 8761
Loss = 2.8968e-02, PNorm = 774.7909, GNorm = 4.0256, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.124554
Epoch 8762
Loss = 7.5705e-02, PNorm = 774.8327, GNorm = 1.2430, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.124324
Epoch 8763
Loss = 4.9929e-02, PNorm = 774.8706, GNorm = 1.1790, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.105086
Epoch 8764
Loss = 1.8688e-01, PNorm = 774.9161, GNorm = 18.4550, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.103280
Epoch 8765
Loss = 6.0674e-02, PNorm = 774.9725, GNorm = 5.7998, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.107558
Epoch 8766
Loss = 5.8482e-02, PNorm = 775.0341, GNorm = 3.8061, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.104881
Epoch 8767
Loss = 2.9007e-02, PNorm = 775.0731, GNorm = 0.5832, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.095174
Epoch 8768
Loss = 3.0002e-03, PNorm = 775.1121, GNorm = 0.1034, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.090044
Epoch 8769
Loss = 1.3419e-02, PNorm = 775.1633, GNorm = 1.5681, lr_0 = 9.9384e-04
Loss = 6.8025e-02, PNorm = 775.2061, GNorm = 0.6559, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.115141
Epoch 8770
Loss = 2.3490e-02, PNorm = 775.2388, GNorm = 1.7159, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.116924
Epoch 8771
Loss = 2.5824e-02, PNorm = 775.2651, GNorm = 0.2430, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.109315
Epoch 8772
Loss = 5.2363e-02, PNorm = 775.2834, GNorm = 0.8462, lr_0 = 9.9384e-04
Validation binary_cross_entropy = 0.096186
Epoch 8773
Loss = 4.1809e-02, PNorm = 775.3043, GNorm = 5.0926, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.085733
Epoch 8774
Loss = 5.2433e-02, PNorm = 775.3405, GNorm = 9.2138, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.090555
Epoch 8775
Loss = 2.9412e-02, PNorm = 775.3729, GNorm = 0.3444, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.091765
Epoch 8776
Loss = 1.9545e-02, PNorm = 775.4049, GNorm = 0.4868, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.086412
Epoch 8777
Loss = 1.8135e-02, PNorm = 775.4329, GNorm = 2.0099, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.102090
Epoch 8778
Loss = 3.2733e-02, PNorm = 775.4692, GNorm = 0.5637, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.112456
Epoch 8779
Loss = 5.5282e-03, PNorm = 775.5005, GNorm = 0.2112, lr_0 = 9.9383e-04
Loss = 5.6440e-02, PNorm = 775.5193, GNorm = 3.4928, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.100654
Epoch 8780
Loss = 4.7180e-02, PNorm = 775.5379, GNorm = 0.4192, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.087895
Epoch 8781
Loss = 1.9830e-02, PNorm = 775.5696, GNorm = 0.2069, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.090641
Epoch 8782
Loss = 2.4636e-02, PNorm = 775.5920, GNorm = 4.1599, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.089118
Epoch 8783
Loss = 6.2236e-02, PNorm = 775.6101, GNorm = 3.6453, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.087624
Epoch 8784
Loss = 1.3344e-02, PNorm = 775.6325, GNorm = 0.9780, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.088762
Epoch 8785
Loss = 2.9361e-02, PNorm = 775.6566, GNorm = 0.3695, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.091492
Epoch 8786
Loss = 1.7921e-02, PNorm = 775.6857, GNorm = 1.2398, lr_0 = 9.9383e-04
Validation binary_cross_entropy = 0.114984
Epoch 8787
Loss = 4.8805e-02, PNorm = 775.7122, GNorm = 1.5168, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.090230
Epoch 8788
Loss = 6.0120e-03, PNorm = 775.7325, GNorm = 0.3392, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.088456
Epoch 8789
Loss = 4.6041e-02, PNorm = 775.7784, GNorm = 3.9912, lr_0 = 9.9382e-04
Loss = 2.5548e-02, PNorm = 775.8353, GNorm = 0.8716, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.115075
Epoch 8790
Loss = 2.7870e-02, PNorm = 775.8633, GNorm = 1.5370, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.103362
Epoch 8791
Loss = 6.5201e-02, PNorm = 775.8754, GNorm = 2.6455, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.095290
Epoch 8792
Loss = 1.7784e-02, PNorm = 775.8945, GNorm = 0.1497, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.098812
Epoch 8793
Loss = 1.6157e-02, PNorm = 775.9210, GNorm = 0.3112, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.112890
Epoch 8794
Loss = 6.8522e-02, PNorm = 775.9467, GNorm = 0.7797, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.130690
Epoch 8795
Loss = 4.9389e-02, PNorm = 775.9670, GNorm = 0.7892, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.105259
Epoch 8796
Loss = 4.5441e-02, PNorm = 775.9919, GNorm = 0.4968, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.109069
Epoch 8797
Loss = 9.6969e-03, PNorm = 776.0239, GNorm = 0.3250, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.118432
Epoch 8798
Loss = 2.9659e-03, PNorm = 776.0475, GNorm = 0.0554, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.132249
Epoch 8799
Loss = 1.6552e-03, PNorm = 776.0625, GNorm = 0.0790, lr_0 = 9.9382e-04
Loss = 2.6578e-02, PNorm = 776.0758, GNorm = 0.1207, lr_0 = 9.9382e-04
Validation binary_cross_entropy = 0.180624
Epoch 8800
Loss = 1.5209e-02, PNorm = 776.0905, GNorm = 0.9677, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.132007
Epoch 8801
Loss = 8.3625e-02, PNorm = 776.1426, GNorm = 0.0875, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.120874
Epoch 8802
Loss = 1.0194e-02, PNorm = 776.1877, GNorm = 0.2886, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.118674
Epoch 8803
Loss = 2.7671e-02, PNorm = 776.2148, GNorm = 1.8846, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.112209
Epoch 8804
Loss = 2.4698e-02, PNorm = 776.2348, GNorm = 0.1492, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.106837
Epoch 8805
Loss = 3.4593e-02, PNorm = 776.2558, GNorm = 5.2802, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.109684
Epoch 8806
Loss = 1.9802e+00, PNorm = 776.3085, GNorm = 0.2683, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.093915
Epoch 8807
Loss = 1.1497e-01, PNorm = 776.4785, GNorm = 4.7206, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.097236
Epoch 8808
Loss = 1.0348e-01, PNorm = 776.6099, GNorm = 4.7285, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.199243
Epoch 8809
Loss = 7.0373e-02, PNorm = 776.7431, GNorm = 3.7968, lr_0 = 9.9381e-04
Loss = 1.6134e-01, PNorm = 776.8396, GNorm = 4.2399, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.110672
Epoch 8810
Loss = 9.2833e-02, PNorm = 776.9259, GNorm = 3.1285, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.138538
Epoch 8811
Loss = 7.2014e-02, PNorm = 776.9891, GNorm = 1.1117, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.106117
Epoch 8812
Loss = 8.7382e-02, PNorm = 777.0408, GNorm = 2.4387, lr_0 = 9.9381e-04
Validation binary_cross_entropy = 0.144129
Epoch 8813
Loss = 1.1350e-01, PNorm = 777.0847, GNorm = 4.0131, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.110719
Epoch 8814
Loss = 6.1028e-02, PNorm = 777.1247, GNorm = 2.9105, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.101934
Epoch 8815
Loss = 4.0884e-02, PNorm = 777.1656, GNorm = 0.3468, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.114552
Epoch 8816
Loss = 1.3038e-01, PNorm = 777.1988, GNorm = 2.8622, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.115203
Epoch 8817
Loss = 8.2512e-02, PNorm = 777.2352, GNorm = 1.6226, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.100863
Epoch 8818
Loss = 7.8711e-02, PNorm = 777.2694, GNorm = 5.6598, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.090954
Epoch 8819
Loss = 2.4267e-02, PNorm = 777.3042, GNorm = 1.2761, lr_0 = 9.9380e-04
Loss = 4.0708e-02, PNorm = 777.3471, GNorm = 1.8259, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.100620
Epoch 8820
Loss = 2.8118e-02, PNorm = 777.3827, GNorm = 1.7856, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.106677
Epoch 8821
Loss = 3.1523e-02, PNorm = 777.4130, GNorm = 0.2961, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.107380
Epoch 8822
Loss = 2.8187e-02, PNorm = 777.4529, GNorm = 0.4012, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.104637
Epoch 8823
Loss = 1.4976e-01, PNorm = 777.5450, GNorm = 8.7382, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.148069
Epoch 8824
Loss = 2.0504e-01, PNorm = 777.6469, GNorm = 30.4355, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.146114
Epoch 8825
Loss = 1.1380e-01, PNorm = 777.7245, GNorm = 9.7312, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.102656
Epoch 8826
Loss = 2.5449e-01, PNorm = 777.7779, GNorm = 17.8459, lr_0 = 9.9380e-04
Validation binary_cross_entropy = 0.144747
Epoch 8827
Loss = 1.0987e-01, PNorm = 777.8312, GNorm = 12.6791, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.135816
Epoch 8828
Loss = 4.5515e-01, PNorm = 777.8920, GNorm = 24.7404, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.246028
Epoch 8829
Loss = 1.5535e-01, PNorm = 777.9944, GNorm = 6.3276, lr_0 = 9.9379e-04
Loss = 1.3473e-01, PNorm = 778.0583, GNorm = 0.1849, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.124492
Epoch 8830
Loss = 6.9599e-02, PNorm = 778.1150, GNorm = 2.7781, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.106990
Epoch 8831
Loss = 6.6816e-02, PNorm = 778.1767, GNorm = 8.0952, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.150697
Epoch 8832
Loss = 8.5657e-02, PNorm = 778.2154, GNorm = 2.3347, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.102615
Epoch 8833
Loss = 6.2476e-02, PNorm = 778.2603, GNorm = 0.5475, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.102341
Epoch 8834
Loss = 9.6634e-02, PNorm = 778.3087, GNorm = 2.8245, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.095883
Epoch 8835
Loss = 4.6471e-02, PNorm = 778.3519, GNorm = 7.3436, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.115686
Epoch 8836
Loss = 3.9510e-02, PNorm = 778.3924, GNorm = 3.3129, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.100943
Epoch 8837
Loss = 1.4368e-02, PNorm = 778.4193, GNorm = 0.2714, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.094053
Epoch 8838
Loss = 3.3453e-02, PNorm = 778.4457, GNorm = 1.4121, lr_0 = 9.9379e-04
Validation binary_cross_entropy = 0.091572
Epoch 8839
Loss = 1.6836e-02, PNorm = 778.4816, GNorm = 0.6228, lr_0 = 9.9379e-04
Loss = 2.9464e-02, PNorm = 778.5169, GNorm = 0.6711, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.104835
Epoch 8840
Loss = 3.0546e-02, PNorm = 778.5364, GNorm = 1.0204, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.102777
Epoch 8841
Loss = 3.0950e-02, PNorm = 778.5562, GNorm = 0.1578, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.104547
Epoch 8842
Loss = 1.0556e-02, PNorm = 778.5822, GNorm = 0.3506, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.108523
Epoch 8843
Loss = 5.8667e-02, PNorm = 778.6017, GNorm = 1.4767, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.104698
Epoch 8844
Loss = 3.1817e-02, PNorm = 778.6217, GNorm = 8.0497, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.109158
Epoch 8845
Loss = 1.8873e-02, PNorm = 778.6528, GNorm = 1.8612, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.104409
Epoch 8846
Loss = 7.2454e-02, PNorm = 778.6834, GNorm = 0.3105, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.095778
Epoch 8847
Loss = 2.2633e-02, PNorm = 778.7262, GNorm = 0.2111, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.102474
Epoch 8848
Loss = 5.2067e-02, PNorm = 778.7728, GNorm = 1.7600, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.118061
Epoch 8849
Loss = 9.3117e-02, PNorm = 778.8182, GNorm = 6.2309, lr_0 = 9.9378e-04
Loss = 2.3779e-02, PNorm = 778.8463, GNorm = 1.1616, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.114537
Epoch 8850
Loss = 2.4706e-02, PNorm = 778.8672, GNorm = 0.9826, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.115196
Epoch 8851
Loss = 4.0163e-02, PNorm = 778.8844, GNorm = 0.1309, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.103129
Epoch 8852
Loss = 1.2197e-01, PNorm = 778.9075, GNorm = 34.6874, lr_0 = 9.9378e-04
Validation binary_cross_entropy = 0.118846
Epoch 8853
Loss = 8.7202e-02, PNorm = 778.9474, GNorm = 6.0232, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.115829
Epoch 8854
Loss = 2.1252e-02, PNorm = 778.9746, GNorm = 2.0149, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.091349
Epoch 8855
Loss = 5.2565e-02, PNorm = 779.0207, GNorm = 2.5179, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.096203
Epoch 8856
Loss = 1.4488e-02, PNorm = 779.0698, GNorm = 0.3511, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.113689
Epoch 8857
Loss = 4.5101e-02, PNorm = 779.1051, GNorm = 7.5404, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.105290
Epoch 8858
Loss = 5.1196e-02, PNorm = 779.1364, GNorm = 5.5335, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.096937
Epoch 8859
Loss = 6.7077e-02, PNorm = 779.1713, GNorm = 2.5647, lr_0 = 9.9377e-04
Loss = 5.5590e-02, PNorm = 779.2053, GNorm = 17.0477, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.094047
Epoch 8860
Loss = 2.5292e-02, PNorm = 779.2443, GNorm = 0.2165, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.096754
Epoch 8861
Loss = 3.4927e-02, PNorm = 779.2917, GNorm = 0.1612, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.100219
Epoch 8862
Loss = 9.4493e-02, PNorm = 779.3315, GNorm = 8.8577, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.112255
Epoch 8863
Loss = 2.0774e-02, PNorm = 779.3746, GNorm = 1.2513, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.118010
Epoch 8864
Loss = 7.6287e-02, PNorm = 779.4096, GNorm = 35.4786, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.093186
Epoch 8865
Loss = 2.3522e-02, PNorm = 779.4800, GNorm = 0.2654, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.095649
Epoch 8866
Loss = 5.3189e-02, PNorm = 779.5420, GNorm = 5.1918, lr_0 = 9.9377e-04
Validation binary_cross_entropy = 0.104808
Epoch 8867
Loss = 8.9775e-03, PNorm = 779.5852, GNorm = 1.2303, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.122509
Epoch 8868
Loss = 3.1267e-03, PNorm = 779.6344, GNorm = 0.4612, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.109615
Epoch 8869
Loss = 5.9049e-03, PNorm = 779.6678, GNorm = 0.5649, lr_0 = 9.9376e-04
Loss = 3.9478e-02, PNorm = 779.6946, GNorm = 1.0964, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.090338
Epoch 8870
Loss = 5.4334e-02, PNorm = 779.7343, GNorm = 0.8695, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.092120
Epoch 8871
Loss = 5.1103e-02, PNorm = 779.8046, GNorm = 0.2062, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.114138
Epoch 8872
Loss = 5.7205e-02, PNorm = 779.8480, GNorm = 4.7505, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.097519
Epoch 8873
Loss = 4.0307e-02, PNorm = 779.8794, GNorm = 3.3457, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.091936
Epoch 8874
Loss = 5.0324e-02, PNorm = 779.9094, GNorm = 5.7216, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.092672
Epoch 8875
Loss = 2.7509e-02, PNorm = 779.9474, GNorm = 0.4066, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.097621
Epoch 8876
Loss = 1.6181e-02, PNorm = 779.9789, GNorm = 1.9391, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.081054
Epoch 8877
Loss = 3.8919e-02, PNorm = 780.0206, GNorm = 3.4188, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.102598
Epoch 8878
Loss = 4.2198e-02, PNorm = 780.1129, GNorm = 1.6032, lr_0 = 9.9376e-04
Validation binary_cross_entropy = 0.097948
Epoch 8879
Loss = 2.5042e-01, PNorm = 780.1758, GNorm = 14.9750, lr_0 = 9.9376e-04
Loss = 2.4837e-02, PNorm = 780.2263, GNorm = 0.4771, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.108350
Epoch 8880
Loss = 4.1256e-02, PNorm = 780.2681, GNorm = 1.6271, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.112359
Epoch 8881
Loss = 6.2790e-02, PNorm = 780.3011, GNorm = 1.1384, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.101927
Epoch 8882
Loss = 1.2665e-01, PNorm = 780.3483, GNorm = 6.3432, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.098796
Epoch 8883
Loss = 2.7145e-02, PNorm = 780.4203, GNorm = 0.3359, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.106780
Epoch 8884
Loss = 3.1350e-02, PNorm = 780.4668, GNorm = 0.7896, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.123007
Epoch 8885
Loss = 2.8905e-02, PNorm = 780.5066, GNorm = 2.3305, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.123384
Epoch 8886
Loss = 2.5207e-02, PNorm = 780.5349, GNorm = 0.2562, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.122546
Epoch 8887
Loss = 2.6576e-02, PNorm = 780.5529, GNorm = 3.7715, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.103850
Epoch 8888
Loss = 2.7021e-03, PNorm = 780.5743, GNorm = 0.1994, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.103231
Epoch 8889
Loss = 2.0430e-01, PNorm = 780.6043, GNorm = 11.8840, lr_0 = 9.9375e-04
Loss = 4.4467e-02, PNorm = 780.6405, GNorm = 1.5636, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.117018
Epoch 8890
Loss = 2.2681e-02, PNorm = 780.6685, GNorm = 0.4462, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.105071
Epoch 8891
Loss = 4.1619e-02, PNorm = 780.6961, GNorm = 2.6318, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.098574
Epoch 8892
Loss = 8.6251e-02, PNorm = 780.7313, GNorm = 3.7332, lr_0 = 9.9375e-04
Validation binary_cross_entropy = 0.250549
Epoch 8893
Loss = 9.3721e-02, PNorm = 780.7906, GNorm = 4.3027, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.114584
Epoch 8894
Loss = 4.5650e-02, PNorm = 780.8683, GNorm = 0.1877, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.148652
Epoch 8895
Loss = 3.8922e-03, PNorm = 780.9246, GNorm = 0.3628, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.165442
Epoch 8896
Loss = 8.2618e-02, PNorm = 780.9558, GNorm = 0.3554, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.162622
Epoch 8897
Loss = 3.5348e-03, PNorm = 780.9920, GNorm = 1.4006, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.155438
Epoch 8898
Loss = 2.1888e-02, PNorm = 781.0175, GNorm = 1.6238, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.128570
Epoch 8899
Loss = 3.0904e-03, PNorm = 781.0372, GNorm = 0.1853, lr_0 = 9.9374e-04
Loss = 5.8258e-02, PNorm = 781.0641, GNorm = 2.0883, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.119659
Epoch 8900
Loss = 3.3796e-02, PNorm = 781.1011, GNorm = 0.5820, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.139765
Epoch 8901
Loss = 5.6330e-02, PNorm = 781.1373, GNorm = 0.8348, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.115144
Epoch 8902
Loss = 1.3306e-01, PNorm = 781.1787, GNorm = 1.5127, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.113014
Epoch 8903
Loss = 3.7877e-02, PNorm = 781.2250, GNorm = 1.5499, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.108528
Epoch 8904
Loss = 8.2076e-03, PNorm = 781.2684, GNorm = 0.5447, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.115628
Epoch 8905
Loss = 4.0047e-02, PNorm = 781.2996, GNorm = 0.2053, lr_0 = 9.9374e-04
Validation binary_cross_entropy = 0.130732
Epoch 8906
Loss = 3.8682e-02, PNorm = 781.3272, GNorm = 1.5028, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.120564
Epoch 8907
Loss = 8.3136e-02, PNorm = 781.3550, GNorm = 0.3172, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.102830
Epoch 8908
Loss = 6.8599e-03, PNorm = 781.3829, GNorm = 0.4258, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.106885
Epoch 8909
Loss = 1.4939e-01, PNorm = 781.4200, GNorm = 7.4902, lr_0 = 9.9373e-04
Loss = 5.7365e-02, PNorm = 781.4649, GNorm = 3.9055, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.140181
Epoch 8910
Loss = 3.4267e-02, PNorm = 781.4987, GNorm = 0.1666, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.126579
Epoch 8911
Loss = 1.0816e-01, PNorm = 781.5198, GNorm = 14.4710, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.086650
Epoch 8912
Loss = 4.3137e-02, PNorm = 781.5493, GNorm = 1.6156, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.076221
Epoch 8913
Loss = 2.2141e-02, PNorm = 781.5962, GNorm = 3.8555, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.081722
Epoch 8914
Loss = 4.1520e-02, PNorm = 781.6411, GNorm = 0.7480, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.081343
Epoch 8915
Loss = 2.1991e-02, PNorm = 781.6745, GNorm = 2.6795, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.084711
Epoch 8916
Loss = 2.6160e-02, PNorm = 781.7166, GNorm = 2.8872, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.091817
Epoch 8917
Loss = 7.1339e-02, PNorm = 781.7460, GNorm = 3.6991, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.087693
Epoch 8918
Loss = 1.0592e-02, PNorm = 781.7790, GNorm = 0.3129, lr_0 = 9.9373e-04
Validation binary_cross_entropy = 0.102917
Epoch 8919
Loss = 1.8764e-02, PNorm = 781.8086, GNorm = 2.4763, lr_0 = 9.9373e-04
Loss = 1.0300e-02, PNorm = 781.8311, GNorm = 0.2178, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.103826
Epoch 8920
Loss = 4.6396e-02, PNorm = 781.8474, GNorm = 1.6030, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.094636
Epoch 8921
Loss = 1.3833e-02, PNorm = 781.8659, GNorm = 0.2651, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.096272
Epoch 8922
Loss = 4.5804e-03, PNorm = 781.8840, GNorm = 0.0719, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.115817
Epoch 8923
Loss = 7.8226e-03, PNorm = 781.9194, GNorm = 1.0511, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.122634
Epoch 8924
Loss = 2.2609e-02, PNorm = 781.9470, GNorm = 0.1327, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.112794
Epoch 8925
Loss = 9.7542e-03, PNorm = 781.9700, GNorm = 2.3389, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.127210
Epoch 8926
Loss = 1.1359e-02, PNorm = 782.0029, GNorm = 1.1395, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.137354
Epoch 8927
Loss = 2.4674e-02, PNorm = 782.0270, GNorm = 2.5395, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.108296
Epoch 8928
Loss = 3.7472e-03, PNorm = 782.0454, GNorm = 0.2232, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.097719
Epoch 8929
Loss = 4.5899e-02, PNorm = 782.0658, GNorm = 2.4229, lr_0 = 9.9372e-04
Loss = 9.6917e-02, PNorm = 782.1168, GNorm = 1.7746, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.085912
Epoch 8930
Loss = 3.4134e-02, PNorm = 782.1879, GNorm = 1.9309, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.101857
Epoch 8931
Loss = 5.5551e-02, PNorm = 782.2250, GNorm = 0.0624, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.122210
Epoch 8932
Loss = 5.1516e-02, PNorm = 782.2441, GNorm = 0.3522, lr_0 = 9.9372e-04
Validation binary_cross_entropy = 0.095060
Epoch 8933
Loss = 5.5992e-02, PNorm = 782.2767, GNorm = 0.4366, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.081678
Epoch 8934
Loss = 7.2176e-02, PNorm = 782.3328, GNorm = 0.9802, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.094495
Epoch 8935
Loss = 2.1222e-02, PNorm = 782.3922, GNorm = 0.1757, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.099092
Epoch 8936
Loss = 6.7644e-02, PNorm = 782.4198, GNorm = 1.5007, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.087399
Epoch 8937
Loss = 3.6879e-03, PNorm = 782.4534, GNorm = 0.2772, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.095386
Epoch 8938
Loss = 6.0599e-03, PNorm = 782.4944, GNorm = 0.3021, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.100821
Epoch 8939
Loss = 4.5081e-04, PNorm = 782.5346, GNorm = 0.0632, lr_0 = 9.9371e-04
Loss = 2.5582e-02, PNorm = 782.5584, GNorm = 1.2517, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.098784
Epoch 8940
Loss = 1.7157e-02, PNorm = 782.5774, GNorm = 2.4789, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.096332
Epoch 8941
Loss = 2.4957e-02, PNorm = 782.5863, GNorm = 0.4095, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.090260
Epoch 8942
Loss = 3.0064e-01, PNorm = 782.6204, GNorm = 31.0664, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.086497
Epoch 8943
Loss = 3.7900e-02, PNorm = 782.6787, GNorm = 0.0629, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.089956
Epoch 8944
Loss = 2.2015e-01, PNorm = 782.7327, GNorm = 20.5499, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.088181
Epoch 8945
Loss = 3.0879e-02, PNorm = 782.7886, GNorm = 1.2718, lr_0 = 9.9371e-04
Validation binary_cross_entropy = 0.094786
Epoch 8946
Loss = 1.8325e-02, PNorm = 782.8328, GNorm = 2.3607, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.090075
Epoch 8947
Loss = 1.2537e-02, PNorm = 782.8615, GNorm = 0.7428, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.122251
Epoch 8948
Loss = 1.4928e-01, PNorm = 782.8876, GNorm = 4.4889, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.090271
Epoch 8949
Loss = 1.3724e-02, PNorm = 782.9061, GNorm = 1.9849, lr_0 = 9.9370e-04
Loss = 5.6576e-02, PNorm = 782.9360, GNorm = 3.7166, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.095189
Epoch 8950
Loss = 3.1059e-02, PNorm = 782.9684, GNorm = 0.8844, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.094127
Epoch 8951
Loss = 2.4349e-02, PNorm = 782.9929, GNorm = 1.3319, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.111772
Epoch 8952
Loss = 2.6937e-02, PNorm = 783.0188, GNorm = 0.1343, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.105858
Epoch 8953
Loss = 6.9346e-02, PNorm = 783.0493, GNorm = 3.0911, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.085155
Epoch 8954
Loss = 2.8839e-02, PNorm = 783.1004, GNorm = 0.3595, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.112165
Epoch 8955
Loss = 3.3293e-02, PNorm = 783.1376, GNorm = 0.2255, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.102798
Epoch 8956
Loss = 1.9739e-01, PNorm = 783.1605, GNorm = 1.9750, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.081407
Epoch 8957
Loss = 1.5401e-01, PNorm = 783.1901, GNorm = 1.0556, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.084186
Epoch 8958
Loss = 1.1137e-01, PNorm = 783.2241, GNorm = 5.3942, lr_0 = 9.9370e-04
Validation binary_cross_entropy = 0.090531
Epoch 8959
Loss = 6.9428e-03, PNorm = 783.2631, GNorm = 0.2661, lr_0 = 9.9370e-04
Loss = 2.9309e-02, PNorm = 783.3005, GNorm = 1.7898, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.114106
Epoch 8960
Loss = 2.7586e-02, PNorm = 783.3299, GNorm = 1.4956, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.082219
Epoch 8961
Loss = 2.8919e-02, PNorm = 783.3530, GNorm = 1.6258, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.092564
Epoch 8962
Loss = 2.5481e-02, PNorm = 783.3842, GNorm = 0.1423, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.119315
Epoch 8963
Loss = 2.2633e-02, PNorm = 783.4088, GNorm = 0.0292, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.130428
Epoch 8964
Loss = 2.5983e-02, PNorm = 783.4244, GNorm = 0.1457, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.131695
Epoch 8965
Loss = 7.7338e-03, PNorm = 783.4441, GNorm = 0.0419, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.119816
Epoch 8966
Loss = 4.5456e-02, PNorm = 783.4558, GNorm = 0.8949, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.109461
Epoch 8967
Loss = 1.6550e-01, PNorm = 783.4791, GNorm = 8.9719, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.113860
Epoch 8968
Loss = 2.5807e-03, PNorm = 783.5151, GNorm = 0.0628, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.138926
Epoch 8969
Loss = 9.1684e-02, PNorm = 783.5458, GNorm = 7.6211, lr_0 = 9.9369e-04
Loss = 2.9567e-02, PNorm = 783.5681, GNorm = 1.4512, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.106217
Epoch 8970
Loss = 3.1534e-02, PNorm = 783.5936, GNorm = 0.7310, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.109000
Epoch 8971
Loss = 1.0206e-02, PNorm = 783.6177, GNorm = 0.8035, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.110484
Epoch 8972
Loss = 2.5119e-02, PNorm = 783.6450, GNorm = 2.8911, lr_0 = 9.9369e-04
Validation binary_cross_entropy = 0.114618
Epoch 8973
Loss = 1.7051e-02, PNorm = 783.6695, GNorm = 0.9939, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.106277
Epoch 8974
Loss = 1.2689e-01, PNorm = 783.7200, GNorm = 1.5382, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.096252
Epoch 8975
Loss = 4.0521e-02, PNorm = 783.7759, GNorm = 1.3024, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.092302
Epoch 8976
Loss = 3.1551e-02, PNorm = 783.8206, GNorm = 1.4442, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.089667
Epoch 8977
Loss = 1.1703e-01, PNorm = 783.8578, GNorm = 0.2012, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.109853
Epoch 8978
Loss = 2.4840e-02, PNorm = 783.8890, GNorm = 1.2131, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.102999
Epoch 8979
Loss = 6.0555e-03, PNorm = 783.9177, GNorm = 0.1924, lr_0 = 9.9368e-04
Loss = 3.2820e-02, PNorm = 783.9508, GNorm = 1.0620, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.099712
Epoch 8980
Loss = 3.8903e-02, PNorm = 783.9824, GNorm = 1.3993, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.094001
Epoch 8981
Loss = 6.5261e-02, PNorm = 784.0163, GNorm = 2.0664, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.121044
Epoch 8982
Loss = 9.3758e-02, PNorm = 784.0504, GNorm = 4.0451, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.099814
Epoch 8983
Loss = 2.4456e-02, PNorm = 784.0874, GNorm = 0.6173, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.087043
Epoch 8984
Loss = 4.4732e-02, PNorm = 784.1164, GNorm = 1.0313, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.080592
Epoch 8985
Loss = 6.1719e-02, PNorm = 784.1415, GNorm = 0.3256, lr_0 = 9.9368e-04
Validation binary_cross_entropy = 0.077815
Epoch 8986
Loss = 3.7939e-02, PNorm = 784.1749, GNorm = 0.5519, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.092186
Epoch 8987
Loss = 3.3453e-02, PNorm = 784.2093, GNorm = 1.0255, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.101146
Epoch 8988
Loss = 6.2348e-02, PNorm = 784.2449, GNorm = 2.4599, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.096820
Epoch 8989
Loss = 1.8949e-02, PNorm = 784.2693, GNorm = 1.8575, lr_0 = 9.9367e-04
Loss = 2.8579e-02, PNorm = 784.2946, GNorm = 0.9951, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.098599
Epoch 8990
Loss = 2.3448e-02, PNorm = 784.3213, GNorm = 0.2777, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.100069
Epoch 8991
Loss = 2.3285e-02, PNorm = 784.3484, GNorm = 1.4480, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.116289
Epoch 8992
Loss = 1.3334e-01, PNorm = 784.3917, GNorm = 0.2468, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.088484
Epoch 8993
Loss = 1.8403e-02, PNorm = 784.4446, GNorm = 0.7771, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.086170
Epoch 8994
Loss = 3.3197e-02, PNorm = 784.4885, GNorm = 1.4548, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.092352
Epoch 8995
Loss = 3.6553e-02, PNorm = 784.5190, GNorm = 0.0247, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.100213
Epoch 8996
Loss = 6.2669e-02, PNorm = 784.5533, GNorm = 0.0258, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.096163
Epoch 8997
Loss = 1.9890e-02, PNorm = 784.5804, GNorm = 1.0660, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.091694
Epoch 8998
Loss = 2.3673e-02, PNorm = 784.6035, GNorm = 2.0932, lr_0 = 9.9367e-04
Validation binary_cross_entropy = 0.091466
Epoch 8999
Loss = 3.0074e-03, PNorm = 784.6246, GNorm = 0.2073, lr_0 = 9.9367e-04
Loss = 2.6682e-02, PNorm = 784.6425, GNorm = 0.5278, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.089253
Epoch 9000
Loss = 1.1128e-02, PNorm = 784.6727, GNorm = 0.1138, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.092440
Epoch 9001
Loss = 1.4461e-02, PNorm = 784.7002, GNorm = 0.1200, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.099761
Epoch 9002
Loss = 2.6661e-02, PNorm = 784.7174, GNorm = 5.2613, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.095988
Epoch 9003
Loss = 8.2936e-02, PNorm = 784.7224, GNorm = 13.5329, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.085106
Epoch 9004
Loss = 4.3806e-02, PNorm = 784.7543, GNorm = 0.9415, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.077804
Epoch 9005
Loss = 7.0706e-02, PNorm = 784.8152, GNorm = 0.3023, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.088583
Epoch 9006
Loss = 3.0618e-02, PNorm = 784.8776, GNorm = 2.4543, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.085312
Epoch 9007
Loss = 8.3020e-02, PNorm = 784.9164, GNorm = 23.1091, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.088000
Epoch 9008
Loss = 4.5103e-02, PNorm = 784.9492, GNorm = 2.9872, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.091793
Epoch 9009
Loss = 7.0184e-03, PNorm = 784.9913, GNorm = 0.5919, lr_0 = 9.9366e-04
Loss = 1.2272e-02, PNorm = 785.0335, GNorm = 0.1439, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.098384
Epoch 9010
Loss = 4.8642e-02, PNorm = 785.0679, GNorm = 9.3922, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.104241
Epoch 9011
Loss = 1.2705e-02, PNorm = 785.1162, GNorm = 0.1147, lr_0 = 9.9366e-04
Validation binary_cross_entropy = 0.131309
Epoch 9012
Loss = 5.5421e-02, PNorm = 785.1577, GNorm = 1.5390, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.107602
Epoch 9013
Loss = 1.8881e-02, PNorm = 785.1930, GNorm = 0.9200, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.100814
Epoch 9014
Loss = 2.3187e-02, PNorm = 785.2241, GNorm = 0.1265, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.111729
Epoch 9015
Loss = 4.4446e-02, PNorm = 785.2540, GNorm = 0.1797, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.078021
Epoch 9016
Loss = 1.5265e-01, PNorm = 785.3998, GNorm = 3.2658, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.245747
Epoch 9017
Loss = 1.7547e-01, PNorm = 785.5765, GNorm = 4.8848, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.120756
Epoch 9018
Loss = 3.0647e-02, PNorm = 785.6711, GNorm = 1.0113, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.112972
Epoch 9019
Loss = 1.9632e-02, PNorm = 785.7478, GNorm = 1.2476, lr_0 = 9.9365e-04
Loss = 4.8555e-02, PNorm = 785.8021, GNorm = 0.2482, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.122227
Epoch 9020
Loss = 7.0158e-02, PNorm = 785.8378, GNorm = 2.2978, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.093183
Epoch 9021
Loss = 4.8048e-02, PNorm = 785.8842, GNorm = 0.2269, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.097917
Epoch 9022
Loss = 4.5273e-02, PNorm = 785.9189, GNorm = 1.4509, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.094332
Epoch 9023
Loss = 4.3301e-02, PNorm = 785.9538, GNorm = 0.7524, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.103934
Epoch 9024
Loss = 3.0980e-02, PNorm = 785.9914, GNorm = 1.3945, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.109979
Epoch 9025
Loss = 2.3051e-02, PNorm = 786.0247, GNorm = 0.1761, lr_0 = 9.9365e-04
Validation binary_cross_entropy = 0.108252
Epoch 9026
Loss = 2.6904e-02, PNorm = 786.0520, GNorm = 1.1001, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.103891
Epoch 9027
Loss = 2.0281e-02, PNorm = 786.0813, GNorm = 1.3424, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.117705
Epoch 9028
Loss = 4.1757e-02, PNorm = 786.1032, GNorm = 1.1730, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.091243
Epoch 9029
Loss = 6.3423e-03, PNorm = 786.1215, GNorm = 0.2125, lr_0 = 9.9364e-04
Loss = 2.5988e-02, PNorm = 786.1496, GNorm = 0.4365, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.130665
Epoch 9030
Loss = 4.6633e-02, PNorm = 786.1697, GNorm = 4.2567, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.093641
Epoch 9031
Loss = 1.8292e-01, PNorm = 786.2185, GNorm = 0.5690, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.098265
Epoch 9032
Loss = 4.7429e-02, PNorm = 786.2857, GNorm = 2.6079, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.111641
Epoch 9033
Loss = 3.7258e-02, PNorm = 786.3437, GNorm = 2.1572, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.099977
Epoch 9034
Loss = 4.7361e-02, PNorm = 786.3862, GNorm = 1.4710, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.094201
Epoch 9035
Loss = 7.8875e-02, PNorm = 786.4403, GNorm = 5.4651, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.110297
Epoch 9036
Loss = 9.4457e-02, PNorm = 786.4884, GNorm = 11.4718, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.105853
Epoch 9037
Loss = 4.1362e-02, PNorm = 786.5191, GNorm = 1.8879, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.088218
Epoch 9038
Loss = 1.5038e-02, PNorm = 786.5408, GNorm = 0.7106, lr_0 = 9.9364e-04
Validation binary_cross_entropy = 0.089315
Epoch 9039
Loss = 1.1032e-02, PNorm = 786.5727, GNorm = 0.8510, lr_0 = 9.9364e-04
Loss = 1.6826e-02, PNorm = 786.6058, GNorm = 0.5186, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.097720
Epoch 9040
Loss = 2.7535e-02, PNorm = 786.6286, GNorm = 0.1385, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.094755
Epoch 9041
Loss = 1.9372e-02, PNorm = 786.6546, GNorm = 0.4164, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.104150
Epoch 9042
Loss = 6.1404e-02, PNorm = 786.6906, GNorm = 0.1598, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.103370
Epoch 9043
Loss = 2.7782e-02, PNorm = 786.7276, GNorm = 0.0943, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.097638
Epoch 9044
Loss = 2.2697e-02, PNorm = 786.7596, GNorm = 0.1113, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.238362
Epoch 9045
Loss = 1.3351e-02, PNorm = 786.7943, GNorm = 0.1645, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.107293
Epoch 9046
Loss = 1.3002e-02, PNorm = 786.8301, GNorm = 1.3731, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.093222
Epoch 9047
Loss = 1.2959e-02, PNorm = 786.8599, GNorm = 0.8306, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.114699
Epoch 9048
Loss = 1.0089e-01, PNorm = 786.9015, GNorm = 1.9315, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.093325
Epoch 9049
Loss = 2.3953e-03, PNorm = 786.9490, GNorm = 0.1933, lr_0 = 9.9363e-04
Loss = 1.5648e-02, PNorm = 786.9879, GNorm = 0.6080, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.089527
Epoch 9050
Loss = 5.5589e-02, PNorm = 787.0163, GNorm = 0.2461, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.091376
Epoch 9051
Loss = 8.4307e-02, PNorm = 787.0545, GNorm = 1.5668, lr_0 = 9.9363e-04
Validation binary_cross_entropy = 0.102038
Epoch 9052
Loss = 1.3334e-02, PNorm = 787.0885, GNorm = 0.7267, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.123675
Epoch 9053
Loss = 4.2766e-02, PNorm = 787.1124, GNorm = 0.7020, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.104734
Epoch 9054
Loss = 6.3132e-02, PNorm = 787.1452, GNorm = 1.2058, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.087293
Epoch 9055
Loss = 5.7914e-02, PNorm = 787.1901, GNorm = 0.1698, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.079273
Epoch 9056
Loss = 4.2693e-02, PNorm = 787.2317, GNorm = 0.8787, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.075418
Epoch 9057
Loss = 1.1567e-01, PNorm = 787.2797, GNorm = 4.3645, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.082875
Epoch 9058
Loss = 8.1578e-03, PNorm = 787.3375, GNorm = 0.6935, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.084009
Epoch 9059
Loss = 2.0838e-02, PNorm = 787.3767, GNorm = 2.4820, lr_0 = 9.9362e-04
Loss = 3.4749e-02, PNorm = 787.4174, GNorm = 0.3204, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.091906
Epoch 9060
Loss = 4.1057e-02, PNorm = 787.4533, GNorm = 1.5922, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.134163
Epoch 9061
Loss = 3.4596e-02, PNorm = 787.4827, GNorm = 0.0838, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.114305
Epoch 9062
Loss = 7.3882e-02, PNorm = 787.5167, GNorm = 0.1771, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.140166
Epoch 9063
Loss = 3.8013e-02, PNorm = 787.5569, GNorm = 0.1850, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.107962
Epoch 9064
Loss = 5.6304e-02, PNorm = 787.5880, GNorm = 8.0858, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.105747
Epoch 9065
Loss = 2.7157e-02, PNorm = 787.6460, GNorm = 0.5172, lr_0 = 9.9362e-04
Validation binary_cross_entropy = 0.115533
Epoch 9066
Loss = 1.5014e-02, PNorm = 787.6972, GNorm = 0.2497, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.118058
Epoch 9067
Loss = 7.2293e-03, PNorm = 787.7295, GNorm = 0.1937, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.121028
Epoch 9068
Loss = 3.5770e-02, PNorm = 787.7501, GNorm = 2.9993, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.116648
Epoch 9069
Loss = 2.3525e-03, PNorm = 787.7727, GNorm = 0.1727, lr_0 = 9.9361e-04
Loss = 5.1058e-02, PNorm = 787.8044, GNorm = 0.2816, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.117310
Epoch 9070
Loss = 5.2367e-02, PNorm = 787.8379, GNorm = 0.9551, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.109113
Epoch 9071
Loss = 3.8522e-02, PNorm = 787.8975, GNorm = 0.5341, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.114932
Epoch 9072
Loss = 2.5366e-02, PNorm = 787.9583, GNorm = 1.4916, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.122955
Epoch 9073
Loss = 2.8045e-02, PNorm = 787.9865, GNorm = 0.7202, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.109302
Epoch 9074
Loss = 1.5664e-02, PNorm = 788.0337, GNorm = 0.3403, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.099605
Epoch 9075
Loss = 3.6124e-02, PNorm = 788.0873, GNorm = 0.6598, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.099003
Epoch 9076
Loss = 2.9108e-02, PNorm = 788.1450, GNorm = 0.1958, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.107588
Epoch 9077
Loss = 7.9063e-02, PNorm = 788.1853, GNorm = 1.9303, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.106419
Epoch 9078
Loss = 5.5094e-03, PNorm = 788.2111, GNorm = 0.1970, lr_0 = 9.9361e-04
Validation binary_cross_entropy = 0.102273
Epoch 9079
Loss = 1.6127e-02, PNorm = 788.2405, GNorm = 1.0886, lr_0 = 9.9360e-04
Loss = 3.3083e-02, PNorm = 788.2884, GNorm = 0.0651, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.110032
Epoch 9080
Loss = 4.0752e-02, PNorm = 788.3231, GNorm = 0.0723, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.103614
Epoch 9081
Loss = 3.0734e-02, PNorm = 788.3586, GNorm = 1.4074, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.098910
Epoch 9082
Loss = 4.1889e-02, PNorm = 788.4114, GNorm = 4.8276, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.104009
Epoch 9083
Loss = 2.3550e-02, PNorm = 788.4578, GNorm = 1.4405, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.103954
Epoch 9084
Loss = 6.8953e-03, PNorm = 788.4921, GNorm = 1.8952, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.104742
Epoch 9085
Loss = 9.6950e-03, PNorm = 788.5101, GNorm = 1.2622, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.108038
Epoch 9086
Loss = 1.8766e-02, PNorm = 788.5253, GNorm = 0.1573, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.150365
Epoch 9087
Loss = 5.4356e-02, PNorm = 788.5619, GNorm = 0.1345, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.103340
Epoch 9088
Loss = 3.9136e-02, PNorm = 788.6156, GNorm = 3.6056, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.103769
Epoch 9089
Loss = 3.4417e-03, PNorm = 788.6698, GNorm = 0.3052, lr_0 = 9.9360e-04
Loss = 6.0434e-02, PNorm = 788.7069, GNorm = 0.5914, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.102187
Epoch 9090
Loss = 2.2718e-02, PNorm = 788.7367, GNorm = 3.5823, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.095673
Epoch 9091
Loss = 2.5980e-02, PNorm = 788.7662, GNorm = 0.1381, lr_0 = 9.9360e-04
Validation binary_cross_entropy = 0.098483
Epoch 9092
Loss = 4.8166e-02, PNorm = 788.7965, GNorm = 1.8581, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.094810
Epoch 9093
Loss = 1.7603e-01, PNorm = 788.8475, GNorm = 1.4527, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.146283
Epoch 9094
Loss = 4.6276e-02, PNorm = 788.8938, GNorm = 1.6749, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.092461
Epoch 9095
Loss = 3.0550e-01, PNorm = 788.9277, GNorm = 1.1145, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.065716
Epoch 9096
Loss = 8.8314e-02, PNorm = 789.0836, GNorm = 2.6008, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.207958
Epoch 9097
Loss = 1.1797e-01, PNorm = 789.1955, GNorm = 1.4605, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.142971
Epoch 9098
Loss = 1.7969e-01, PNorm = 789.2746, GNorm = 4.0514, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.109826
Epoch 9099
Loss = 1.4529e-01, PNorm = 789.3370, GNorm = 9.4692, lr_0 = 9.9359e-04
Loss = 7.6831e-02, PNorm = 789.3882, GNorm = 4.7508, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.082799
Epoch 9100
Loss = 9.2513e-02, PNorm = 789.4442, GNorm = 2.4146, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.102654
Epoch 9101
Loss = 4.0229e-02, PNorm = 789.5051, GNorm = 0.5083, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.108407
Epoch 9102
Loss = 2.2625e-02, PNorm = 789.5470, GNorm = 1.9801, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.099462
Epoch 9103
Loss = 3.9630e-02, PNorm = 789.5798, GNorm = 0.3922, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.317206
Epoch 9104
Loss = 9.0472e-02, PNorm = 789.6127, GNorm = 0.2378, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.113843
Epoch 9105
Loss = 3.4324e-02, PNorm = 789.6639, GNorm = 2.5426, lr_0 = 9.9359e-04
Validation binary_cross_entropy = 0.148036
Epoch 9106
Loss = 4.0535e-02, PNorm = 789.7071, GNorm = 1.1541, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.152915
Epoch 9107
Loss = 3.9272e-02, PNorm = 789.7444, GNorm = 0.8801, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.228262
Epoch 9108
Loss = 2.6922e-02, PNorm = 789.7850, GNorm = 1.8696, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.113385
Epoch 9109
Loss = 1.4974e-03, PNorm = 789.8313, GNorm = 0.1466, lr_0 = 9.9358e-04
Loss = 1.5424e-02, PNorm = 789.8852, GNorm = 1.5538, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.112252
Epoch 9110
Loss = 3.3388e-02, PNorm = 789.9204, GNorm = 0.2582, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.117453
Epoch 9111
Loss = 6.2056e-02, PNorm = 789.9723, GNorm = 1.3185, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.122019
Epoch 9112
Loss = 1.4374e-01, PNorm = 790.0317, GNorm = 0.2129, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.089137
Epoch 9113
Loss = 3.8248e-02, PNorm = 790.0870, GNorm = 0.8075, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.096681
Epoch 9114
Loss = 2.9074e-02, PNorm = 790.1398, GNorm = 0.4907, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.106188
Epoch 9115
Loss = 2.6686e-02, PNorm = 790.1890, GNorm = 4.4864, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.189207
Epoch 9116
Loss = 2.0252e-01, PNorm = 790.2569, GNorm = 0.0509, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.122305
Epoch 9117
Loss = 1.3651e-01, PNorm = 790.3263, GNorm = 5.0262, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.165685
Epoch 9118
Loss = 1.1050e-02, PNorm = 790.3997, GNorm = 0.6554, lr_0 = 9.9358e-04
Validation binary_cross_entropy = 0.200395
Epoch 9119
Loss = 4.0349e-03, PNorm = 790.4481, GNorm = 0.2001, lr_0 = 9.9357e-04
Loss = 6.6614e-02, PNorm = 790.4921, GNorm = 0.3388, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.144053
Epoch 9120
Loss = 7.6114e-02, PNorm = 790.5355, GNorm = 11.9853, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.135798
Epoch 9121
Loss = 7.1084e-02, PNorm = 790.5839, GNorm = 1.5941, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.101653
Epoch 9122
Loss = 1.3521e-01, PNorm = 790.6579, GNorm = 1.2184, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.099735
Epoch 9123
Loss = 3.0342e-02, PNorm = 790.7481, GNorm = 2.9209, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.131359
Epoch 9124
Loss = 1.2569e-01, PNorm = 790.8186, GNorm = 2.4917, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.422455
Epoch 9125
Loss = 4.8401e-01, PNorm = 790.9095, GNorm = 0.7901, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.091645
Epoch 9126
Loss = 1.4129e-01, PNorm = 791.0299, GNorm = 1.6770, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.235763
Epoch 9127
Loss = 1.0540e-01, PNorm = 791.1162, GNorm = 5.2712, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.084708
Epoch 9128
Loss = 1.9137e-02, PNorm = 791.1653, GNorm = 0.5851, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.108737
Epoch 9129
Loss = 6.9848e-02, PNorm = 791.2156, GNorm = 1.1352, lr_0 = 9.9357e-04
Loss = 3.8924e-02, PNorm = 791.2481, GNorm = 1.7045, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.135526
Epoch 9130
Loss = 4.7001e-02, PNorm = 791.2712, GNorm = 1.1177, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.106219
Epoch 9131
Loss = 5.1239e-02, PNorm = 791.3001, GNorm = 0.2155, lr_0 = 9.9357e-04
Validation binary_cross_entropy = 0.116734
Epoch 9132
Loss = 1.5916e-02, PNorm = 791.3335, GNorm = 1.6136, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.155131
Epoch 9133
Loss = 1.2182e-02, PNorm = 791.3567, GNorm = 0.0774, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.167681
Epoch 9134
Loss = 2.0290e-02, PNorm = 791.3756, GNorm = 0.8203, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.152704
Epoch 9135
Loss = 4.2060e-02, PNorm = 791.3935, GNorm = 4.4702, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.121030
Epoch 9136
Loss = 2.9876e-02, PNorm = 791.4258, GNorm = 2.1922, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.114374
Epoch 9137
Loss = 1.6129e-02, PNorm = 791.4560, GNorm = 0.3380, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.120687
Epoch 9138
Loss = 1.8316e-03, PNorm = 791.4826, GNorm = 0.1631, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.125948
Epoch 9139
Loss = 1.3607e-02, PNorm = 791.5009, GNorm = 0.8364, lr_0 = 9.9356e-04
Loss = 5.8473e-02, PNorm = 791.5204, GNorm = 0.1706, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.119159
Epoch 9140
Loss = 5.6745e-02, PNorm = 791.5435, GNorm = 0.2093, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.104512
Epoch 9141
Loss = 3.6166e-02, PNorm = 791.5762, GNorm = 0.2833, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.105720
Epoch 9142
Loss = 1.8644e-02, PNorm = 791.6033, GNorm = 0.2593, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.219669
Epoch 9143
Loss = 1.0902e-01, PNorm = 791.6245, GNorm = 1.8300, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.117289
Epoch 9144
Loss = 1.6679e-02, PNorm = 791.6682, GNorm = 1.9207, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.121584
Epoch 9145
Loss = 1.9099e-01, PNorm = 791.7040, GNorm = 1.1806, lr_0 = 9.9356e-04
Validation binary_cross_entropy = 0.096297
Epoch 9146
Loss = 2.9754e-02, PNorm = 791.7465, GNorm = 0.7214, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.090680
Epoch 9147
Loss = 2.8346e-02, PNorm = 791.7824, GNorm = 0.7012, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.096329
Epoch 9148
Loss = 9.3309e-03, PNorm = 791.8173, GNorm = 1.4821, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.107910
Epoch 9149
Loss = 4.2363e-02, PNorm = 791.8508, GNorm = 1.6117, lr_0 = 9.9355e-04
Loss = 2.4026e-02, PNorm = 791.8778, GNorm = 0.5197, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.106765
Epoch 9150
Loss = 2.3014e-02, PNorm = 791.9040, GNorm = 0.5263, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.130600
Epoch 9151
Loss = 5.0949e-02, PNorm = 791.9230, GNorm = 2.2897, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.135121
Epoch 9152
Loss = 2.1039e-01, PNorm = 791.9349, GNorm = 2.5952, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.099491
Epoch 9153
Loss = 9.4712e-03, PNorm = 791.9636, GNorm = 0.8758, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.101029
Epoch 9154
Loss = 2.0683e-02, PNorm = 791.9874, GNorm = 0.2850, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.093403
Epoch 9155
Loss = 3.7540e-02, PNorm = 792.0084, GNorm = 0.3311, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.088959
Epoch 9156
Loss = 6.3919e-02, PNorm = 792.0264, GNorm = 0.6459, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.101821
Epoch 9157
Loss = 3.3646e+00, PNorm = 792.0527, GNorm = 1265.3281, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.121381
Epoch 9158
Loss = 1.8919e-02, PNorm = 792.3043, GNorm = 1.8460, lr_0 = 9.9355e-04
Validation binary_cross_entropy = 0.161823
Epoch 9159
Loss = 7.4343e-02, PNorm = 792.5098, GNorm = 3.6016, lr_0 = 9.9354e-04
Loss = 2.0010e-01, PNorm = 792.6427, GNorm = 6.7245, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.145634
Epoch 9160
Loss = 1.1401e-01, PNorm = 792.7257, GNorm = 1.1156, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.120612
Epoch 9161
Loss = 8.2316e-02, PNorm = 792.7797, GNorm = 1.6735, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.124124
Epoch 9162
Loss = 4.9579e-02, PNorm = 792.8281, GNorm = 0.9437, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.133750
Epoch 9163
Loss = 3.3102e-02, PNorm = 792.8759, GNorm = 2.5725, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.153165
Epoch 9164
Loss = 1.0836e-01, PNorm = 792.9036, GNorm = 1.5992, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.110412
Epoch 9165
Loss = 1.0975e-01, PNorm = 792.9386, GNorm = 8.0667, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.126840
Epoch 9166
Loss = 6.4054e-02, PNorm = 792.9812, GNorm = 4.6618, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.125872
Epoch 9167
Loss = 1.8202e-02, PNorm = 793.0092, GNorm = 0.5029, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.127569
Epoch 9168
Loss = 3.6113e-02, PNorm = 793.0406, GNorm = 0.3509, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.149692
Epoch 9169
Loss = 7.2779e-02, PNorm = 793.0738, GNorm = 2.3926, lr_0 = 9.9354e-04
Loss = 1.4723e-01, PNorm = 793.1059, GNorm = 0.8845, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.112125
Epoch 9170
Loss = 5.4591e-02, PNorm = 793.1515, GNorm = 0.4237, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.125181
Epoch 9171
Loss = 4.1996e-02, PNorm = 793.1862, GNorm = 5.0911, lr_0 = 9.9354e-04
Validation binary_cross_entropy = 0.136178
Epoch 9172
Loss = 1.1760e-01, PNorm = 793.2117, GNorm = 6.7474, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.111622
Epoch 9173
Loss = 2.7576e-02, PNorm = 793.2426, GNorm = 0.2822, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.109517
Epoch 9174
Loss = 5.1447e-02, PNorm = 793.2765, GNorm = 4.1905, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.131375
Epoch 9175
Loss = 1.1936e-01, PNorm = 793.3070, GNorm = 2.1362, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.144336
Epoch 9176
Loss = 5.0287e-02, PNorm = 793.3276, GNorm = 0.8959, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.110745
Epoch 9177
Loss = 7.6794e-02, PNorm = 793.3613, GNorm = 10.5537, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.103847
Epoch 9178
Loss = 5.0332e-03, PNorm = 793.4061, GNorm = 0.1552, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.112611
Epoch 9179
Loss = 1.1832e-02, PNorm = 793.4454, GNorm = 0.4575, lr_0 = 9.9353e-04
Loss = 1.5751e-02, PNorm = 793.4697, GNorm = 0.3501, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.126913
Epoch 9180
Loss = 2.4024e-02, PNorm = 793.4824, GNorm = 2.5095, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.121615
Epoch 9181
Loss = 6.2009e-02, PNorm = 793.4935, GNorm = 0.4611, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.115899
Epoch 9182
Loss = 3.1164e-02, PNorm = 793.5145, GNorm = 8.6240, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.108256
Epoch 9183
Loss = 1.1222e-01, PNorm = 793.5441, GNorm = 12.7028, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.099549
Epoch 9184
Loss = 6.5656e-02, PNorm = 793.5967, GNorm = 1.1374, lr_0 = 9.9353e-04
Validation binary_cross_entropy = 0.108938
Epoch 9185
Loss = 1.2291e-01, PNorm = 793.6459, GNorm = 0.2382, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.183878
Epoch 9186
Loss = 3.7222e-02, PNorm = 793.6788, GNorm = 0.2011, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.150808
Epoch 9187
Loss = 1.3617e-01, PNorm = 793.7039, GNorm = 0.5911, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.127151
Epoch 9188
Loss = 1.4957e-01, PNorm = 793.7439, GNorm = 6.3402, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.101583
Epoch 9189
Loss = 3.4931e-02, PNorm = 793.7818, GNorm = 1.6837, lr_0 = 9.9352e-04
Loss = 4.4576e-02, PNorm = 793.8211, GNorm = 0.6160, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.104335
Epoch 9190
Loss = 3.8528e-02, PNorm = 793.8472, GNorm = 0.8663, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.096999
Epoch 9191
Loss = 3.4911e-02, PNorm = 793.8702, GNorm = 2.7448, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.099504
Epoch 9192
Loss = 4.2680e-02, PNorm = 793.9011, GNorm = 0.5319, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.114060
Epoch 9193
Loss = 1.3631e-02, PNorm = 793.9236, GNorm = 1.5980, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.117718
Epoch 9194
Loss = 2.5399e-02, PNorm = 793.9400, GNorm = 4.5054, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.112400
Epoch 9195
Loss = 2.0375e-02, PNorm = 793.9594, GNorm = 3.0028, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.113999
Epoch 9196
Loss = 2.2391e-02, PNorm = 793.9920, GNorm = 1.3039, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.113909
Epoch 9197
Loss = 2.0047e-02, PNorm = 794.0263, GNorm = 2.2446, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.112416
Epoch 9198
Loss = 8.4214e-02, PNorm = 794.0587, GNorm = 10.4094, lr_0 = 9.9352e-04
Validation binary_cross_entropy = 0.110927
Epoch 9199
Loss = 2.6048e-01, PNorm = 794.0842, GNorm = 13.0038, lr_0 = 9.9351e-04
Loss = 3.6170e-02, PNorm = 794.1286, GNorm = 0.1714, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.107890
Epoch 9200
Loss = 4.2349e-02, PNorm = 794.1838, GNorm = 0.2143, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.110513
Epoch 9201
Loss = 2.6368e-02, PNorm = 794.2255, GNorm = 1.1566, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.112304
Epoch 9202
Loss = 2.1709e-02, PNorm = 794.2498, GNorm = 0.1651, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.138299
Epoch 9203
Loss = 7.3269e-02, PNorm = 794.2699, GNorm = 6.7901, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.114571
Epoch 9204
Loss = 2.2343e-02, PNorm = 794.2959, GNorm = 0.1281, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.115217
Epoch 9205
Loss = 6.9667e-02, PNorm = 794.3213, GNorm = 18.3777, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.121202
Epoch 9206
Loss = 9.2943e-02, PNorm = 794.3563, GNorm = 0.1673, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.117999
Epoch 9207
Loss = 1.0495e-02, PNorm = 794.4042, GNorm = 0.3578, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.127295
Epoch 9208
Loss = 7.9939e-02, PNorm = 794.4509, GNorm = 0.3512, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.126553
Epoch 9209
Loss = 2.0131e-03, PNorm = 794.4969, GNorm = 0.1559, lr_0 = 9.9351e-04
Loss = 6.4134e-02, PNorm = 794.5379, GNorm = 19.2640, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.135895
Epoch 9210
Loss = 6.4955e-02, PNorm = 794.5781, GNorm = 8.1204, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.136444
Epoch 9211
Loss = 3.4840e-02, PNorm = 794.6104, GNorm = 0.4367, lr_0 = 9.9351e-04
Validation binary_cross_entropy = 0.114495
Epoch 9212
Loss = 6.5019e-02, PNorm = 794.6567, GNorm = 0.6630, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.123434
Epoch 9213
Loss = 3.8711e-02, PNorm = 794.7067, GNorm = 2.4627, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.127579
Epoch 9214
Loss = 6.5053e-02, PNorm = 794.7380, GNorm = 0.3452, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.116223
Epoch 9215
Loss = 1.3897e-01, PNorm = 794.7737, GNorm = 2.0568, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.119497
Epoch 9216
Loss = 8.2212e-02, PNorm = 794.8195, GNorm = 4.1904, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.122394
Epoch 9217
Loss = 1.2308e-02, PNorm = 794.8540, GNorm = 0.2030, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.123828
Epoch 9218
Loss = 5.0927e-02, PNorm = 794.8785, GNorm = 2.6039, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.128007
Epoch 9219
Loss = 1.2849e-01, PNorm = 794.8989, GNorm = 1.6582, lr_0 = 9.9350e-04
Loss = 4.2464e-02, PNorm = 794.9194, GNorm = 1.5442, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.142589
Epoch 9220
Loss = 2.1571e-02, PNorm = 794.9304, GNorm = 0.0655, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.139453
Epoch 9221
Loss = 4.8750e-02, PNorm = 794.9358, GNorm = 0.2285, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.109872
Epoch 9222
Loss = 2.8128e-02, PNorm = 794.9623, GNorm = 0.9005, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.126766
Epoch 9223
Loss = 2.3925e-02, PNorm = 794.9906, GNorm = 0.1399, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.110673
Epoch 9224
Loss = 3.1000e-02, PNorm = 795.0173, GNorm = 0.9028, lr_0 = 9.9350e-04
Validation binary_cross_entropy = 0.113824
Epoch 9225
Loss = 2.9593e-02, PNorm = 795.0525, GNorm = 4.2584, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.121897
Epoch 9226
Loss = 4.7562e-02, PNorm = 795.0821, GNorm = 3.1191, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.121017
Epoch 9227
Loss = 3.0082e-03, PNorm = 795.1054, GNorm = 0.1613, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.127592
Epoch 9228
Loss = 4.6523e-03, PNorm = 795.1272, GNorm = 0.2697, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.134393
Epoch 9229
Loss = 2.0119e-02, PNorm = 795.1430, GNorm = 1.7583, lr_0 = 9.9349e-04
Loss = 9.1612e-03, PNorm = 795.1560, GNorm = 0.4636, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.133797
Epoch 9230
Loss = 3.6598e-02, PNorm = 795.1698, GNorm = 0.4255, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.127069
Epoch 9231
Loss = 1.3937e-02, PNorm = 795.1851, GNorm = 0.5035, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.123119
Epoch 9232
Loss = 6.3248e-02, PNorm = 795.2120, GNorm = 0.9459, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.122112
Epoch 9233
Loss = 1.5722e-02, PNorm = 795.2473, GNorm = 0.1232, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.128693
Epoch 9234
Loss = 3.3904e-02, PNorm = 795.2766, GNorm = 0.3969, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.153165
Epoch 9235
Loss = 1.0218e-01, PNorm = 795.3093, GNorm = 0.2140, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.164748
Epoch 9236
Loss = 3.7588e-02, PNorm = 795.3503, GNorm = 2.3391, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.160103
Epoch 9237
Loss = 1.2169e-02, PNorm = 795.3793, GNorm = 0.9501, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.149200
Epoch 9238
Loss = 2.3729e-02, PNorm = 795.4028, GNorm = 1.4119, lr_0 = 9.9349e-04
Validation binary_cross_entropy = 0.122432
Epoch 9239
Loss = 7.1337e-03, PNorm = 795.4246, GNorm = 0.5778, lr_0 = 9.9348e-04
Loss = 2.1386e-02, PNorm = 795.4629, GNorm = 0.2735, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.130557
Epoch 9240
Loss = 2.0220e-01, PNorm = 795.4960, GNorm = 0.4458, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.108052
Epoch 9241
Loss = 1.0131e-01, PNorm = 795.5585, GNorm = 7.3281, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.153819
Epoch 9242
Loss = 2.0803e-01, PNorm = 795.6234, GNorm = 3.1184, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.089095
Epoch 9243
Loss = 4.4599e-02, PNorm = 795.7036, GNorm = 0.8878, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.104973
Epoch 9244
Loss = 7.5999e-02, PNorm = 795.7651, GNorm = 3.9017, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.114427
Epoch 9245
Loss = 6.7654e-02, PNorm = 795.8027, GNorm = 5.1759, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.106262
Epoch 9246
Loss = 3.1530e-02, PNorm = 795.8375, GNorm = 9.4148, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.099960
Epoch 9247
Loss = 1.0722e-01, PNorm = 795.8725, GNorm = 3.1671, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.102789
Epoch 9248
Loss = 4.6928e-02, PNorm = 795.9131, GNorm = 1.3242, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.104271
Epoch 9249
Loss = 1.1436e-02, PNorm = 795.9453, GNorm = 0.4540, lr_0 = 9.9348e-04
Loss = 2.0439e-02, PNorm = 795.9711, GNorm = 0.1303, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.103281
Epoch 9250
Loss = 1.4978e-02, PNorm = 795.9933, GNorm = 0.0783, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.109027
Epoch 9251
Loss = 4.0392e-02, PNorm = 796.0104, GNorm = 0.4420, lr_0 = 9.9348e-04
Validation binary_cross_entropy = 0.103802
Epoch 9252
Loss = 2.8095e-02, PNorm = 796.0314, GNorm = 3.2204, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.100779
Epoch 9253
Loss = 5.1501e-02, PNorm = 796.0479, GNorm = 2.5952, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.097434
Epoch 9254
Loss = 6.9865e-02, PNorm = 796.0842, GNorm = 0.2997, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.107894
Epoch 9255
Loss = 2.0664e-02, PNorm = 796.1272, GNorm = 0.7999, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.119230
Epoch 9256
Loss = 2.7642e-02, PNorm = 796.1671, GNorm = 3.2016, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.125730
Epoch 9257
Loss = 3.9164e-02, PNorm = 796.1902, GNorm = 1.0061, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.116858
Epoch 9258
Loss = 1.2710e-01, PNorm = 796.2049, GNorm = 0.3923, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.126276
Epoch 9259
Loss = 1.1844e-02, PNorm = 796.2205, GNorm = 0.5917, lr_0 = 9.9347e-04
Loss = 3.2921e-02, PNorm = 796.2381, GNorm = 5.5440, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.117337
Epoch 9260
Loss = 2.4218e-02, PNorm = 796.2696, GNorm = 2.4541, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.115066
Epoch 9261
Loss = 5.6250e-02, PNorm = 796.3035, GNorm = 2.0311, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.124084
Epoch 9262
Loss = 2.4870e-02, PNorm = 796.3331, GNorm = 0.2742, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.121294
Epoch 9263
Loss = 5.2086e-02, PNorm = 796.3515, GNorm = 2.0027, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.115748
Epoch 9264
Loss = 1.0533e-01, PNorm = 796.3696, GNorm = 0.4976, lr_0 = 9.9347e-04
Validation binary_cross_entropy = 0.103097
Epoch 9265
Loss = 2.2449e-02, PNorm = 796.4045, GNorm = 1.2412, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.105657
Epoch 9266
Loss = 1.7090e-02, PNorm = 796.4407, GNorm = 0.7258, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.110294
Epoch 9267
Loss = 6.4314e-02, PNorm = 796.4727, GNorm = 0.3027, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.115830
Epoch 9268
Loss = 2.5874e-02, PNorm = 796.5118, GNorm = 1.3646, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.112995
Epoch 9269
Loss = 5.9503e-02, PNorm = 796.5451, GNorm = 4.2482, lr_0 = 9.9346e-04
Loss = 5.0765e-02, PNorm = 796.5704, GNorm = 2.8147, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.111117
Epoch 9270
Loss = 1.2795e-02, PNorm = 796.5943, GNorm = 1.4227, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.107107
Epoch 9271
Loss = 2.0984e-02, PNorm = 796.6119, GNorm = 0.8358, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.108362
Epoch 9272
Loss = 3.7444e-02, PNorm = 796.6285, GNorm = 6.5870, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.099795
Epoch 9273
Loss = 1.0808e-02, PNorm = 796.6487, GNorm = 0.2145, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.100486
Epoch 9274
Loss = 5.8019e-02, PNorm = 796.6717, GNorm = 2.2069, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.108109
Epoch 9275
Loss = 1.5168e-02, PNorm = 796.7226, GNorm = 0.2868, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.101497
Epoch 9276
Loss = 4.1079e-02, PNorm = 796.7621, GNorm = 1.7236, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.101960
Epoch 9277
Loss = 1.1656e-02, PNorm = 796.7872, GNorm = 1.3228, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.103558
Epoch 9278
Loss = 1.4226e-03, PNorm = 796.8123, GNorm = 0.1258, lr_0 = 9.9346e-04
Validation binary_cross_entropy = 0.113104
Epoch 9279
Loss = 5.5198e-03, PNorm = 796.8380, GNorm = 0.9062, lr_0 = 9.9345e-04
Loss = 1.6783e-02, PNorm = 796.8665, GNorm = 0.3326, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.116494
Epoch 9280
Loss = 2.0383e-01, PNorm = 796.9111, GNorm = 15.7123, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.078293
Epoch 9281
Loss = 4.4844e-02, PNorm = 796.9841, GNorm = 2.4109, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.095824
Epoch 9282
Loss = 3.6250e-02, PNorm = 797.0310, GNorm = 3.1259, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.091873
Epoch 9283
Loss = 2.7946e-02, PNorm = 797.0555, GNorm = 1.7281, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.083220
Epoch 9284
Loss = 2.9333e-02, PNorm = 797.0796, GNorm = 0.2782, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.091283
Epoch 9285
Loss = 4.2501e-02, PNorm = 797.1141, GNorm = 0.9203, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.097477
Epoch 9286
Loss = 2.3605e-02, PNorm = 797.1403, GNorm = 3.1810, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.127051
Epoch 9287
Loss = 3.3433e-02, PNorm = 797.1579, GNorm = 6.8149, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.087191
Epoch 9288
Loss = 1.4399e-01, PNorm = 797.1728, GNorm = 10.7863, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.087646
Epoch 9289
Loss = 1.0940e-02, PNorm = 797.2044, GNorm = 0.5588, lr_0 = 9.9345e-04
Loss = 2.9691e-02, PNorm = 797.2426, GNorm = 2.6593, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.090347
Epoch 9290
Loss = 6.0124e-02, PNorm = 797.2728, GNorm = 3.0832, lr_0 = 9.9345e-04
Validation binary_cross_entropy = 0.089692
Epoch 9291
Loss = 1.1753e-02, PNorm = 797.3022, GNorm = 0.3375, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.096613
Epoch 9292
Loss = 2.7319e-02, PNorm = 797.3177, GNorm = 1.3347, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.083607
Epoch 9293
Loss = 3.0716e-02, PNorm = 797.3481, GNorm = 0.4529, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.085207
Epoch 9294
Loss = 8.4097e-02, PNorm = 797.3780, GNorm = 0.2303, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.088763
Epoch 9295
Loss = 3.6529e-02, PNorm = 797.4009, GNorm = 3.4871, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.086339
Epoch 9296
Loss = 3.4329e-02, PNorm = 797.4191, GNorm = 1.2109, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.079723
Epoch 9297
Loss = 4.1580e-02, PNorm = 797.4403, GNorm = 1.2624, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.079940
Epoch 9298
Loss = 8.7431e-02, PNorm = 797.4672, GNorm = 19.3952, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.097995
Epoch 9299
Loss = 6.5519e-02, PNorm = 797.5091, GNorm = 3.6755, lr_0 = 9.9344e-04
Loss = 2.6693e-02, PNorm = 797.5333, GNorm = 1.6382, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.086619
Epoch 9300
Loss = 3.1134e-02, PNorm = 797.5520, GNorm = 1.0834, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.085460
Epoch 9301
Loss = 5.4271e-02, PNorm = 797.5778, GNorm = 1.7577, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.094282
Epoch 9302
Loss = 1.2305e-02, PNorm = 797.6047, GNorm = 0.0885, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.098002
Epoch 9303
Loss = 1.2287e-02, PNorm = 797.6169, GNorm = 0.7721, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.088500
Epoch 9304
Loss = 1.2175e-02, PNorm = 797.6262, GNorm = 0.7301, lr_0 = 9.9344e-04
Validation binary_cross_entropy = 0.082172
Epoch 9305
Loss = 8.1693e-02, PNorm = 797.6373, GNorm = 12.0553, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.084248
Epoch 9306
Loss = 1.4884e-02, PNorm = 797.6651, GNorm = 0.1411, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.091179
Epoch 9307
Loss = 4.9894e-02, PNorm = 797.6891, GNorm = 1.0890, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.097561
Epoch 9308
Loss = 2.4772e-02, PNorm = 797.7117, GNorm = 1.3399, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.084477
Epoch 9309
Loss = 1.1239e-02, PNorm = 797.7225, GNorm = 1.0269, lr_0 = 9.9343e-04
Loss = 1.8487e-02, PNorm = 797.7407, GNorm = 0.7756, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.084467
Epoch 9310
Loss = 3.6696e-02, PNorm = 797.7622, GNorm = 1.3574, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.079945
Epoch 9311
Loss = 5.2401e-02, PNorm = 797.7837, GNorm = 8.5976, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.078132
Epoch 9312
Loss = 1.8452e-02, PNorm = 797.8100, GNorm = 0.7808, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.076383
Epoch 9313
Loss = 1.7123e-02, PNorm = 797.8409, GNorm = 0.3856, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.078393
Epoch 9314
Loss = 2.0596e-02, PNorm = 797.8674, GNorm = 0.4809, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.110821
Epoch 9315
Loss = 8.7721e-02, PNorm = 797.8900, GNorm = 0.4043, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.111534
Epoch 9316
Loss = 8.3697e-03, PNorm = 797.9071, GNorm = 0.1154, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.115798
Epoch 9317
Loss = 1.6442e-02, PNorm = 797.9172, GNorm = 1.2141, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.182173
Epoch 9318
Loss = 2.5617e-03, PNorm = 797.9355, GNorm = 0.1123, lr_0 = 9.9343e-04
Validation binary_cross_entropy = 0.105514
Epoch 9319
Loss = 2.8673e-01, PNorm = 797.9522, GNorm = 15.5697, lr_0 = 9.9342e-04
Loss = 7.6360e-02, PNorm = 797.9906, GNorm = 0.3900, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.082766
Epoch 9320
Loss = 2.4385e-02, PNorm = 798.0310, GNorm = 0.3135, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.083325
Epoch 9321
Loss = 3.4189e-02, PNorm = 798.0678, GNorm = 7.9760, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.087148
Epoch 9322
Loss = 5.6780e-02, PNorm = 798.0963, GNorm = 1.5320, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.081387
Epoch 9323
Loss = 4.3704e-02, PNorm = 798.1255, GNorm = 0.7902, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.085504
Epoch 9324
Loss = 2.2210e-02, PNorm = 798.1621, GNorm = 0.2815, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.093750
Epoch 9325
Loss = 1.2632e-01, PNorm = 798.1974, GNorm = 1.2963, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.100601
Epoch 9326
Loss = 3.2992e-02, PNorm = 798.2352, GNorm = 0.8581, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.107436
Epoch 9327
Loss = 2.6492e-02, PNorm = 798.2644, GNorm = 2.3665, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.115536
Epoch 9328
Loss = 3.7440e-02, PNorm = 798.3075, GNorm = 1.7287, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.098572
Epoch 9329
Loss = 2.5740e-02, PNorm = 798.3628, GNorm = 1.4644, lr_0 = 9.9342e-04
Loss = 4.6279e-02, PNorm = 798.4531, GNorm = 2.3266, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.078523
Epoch 9330
Loss = 9.6367e-02, PNorm = 798.5391, GNorm = 1.3788, lr_0 = 9.9342e-04
Validation binary_cross_entropy = 0.125362
Epoch 9331
Loss = 4.5053e-02, PNorm = 798.6013, GNorm = 0.9602, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.107175
Epoch 9332
Loss = 6.8558e-02, PNorm = 798.6546, GNorm = 6.1935, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.111167
Epoch 9333
Loss = 4.5860e-02, PNorm = 798.7000, GNorm = 1.9653, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.102419
Epoch 9334
Loss = 2.6286e-02, PNorm = 798.7378, GNorm = 4.4998, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.102793
Epoch 9335
Loss = 1.7222e-02, PNorm = 798.7672, GNorm = 0.2118, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.093562
Epoch 9336
Loss = 4.1673e-02, PNorm = 798.7994, GNorm = 5.5251, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.109375
Epoch 9337
Loss = 3.8939e-02, PNorm = 798.8633, GNorm = 2.1015, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.091216
Epoch 9338
Loss = 2.1769e-02, PNorm = 798.8990, GNorm = 0.6682, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.091180
Epoch 9339
Loss = 1.9600e-02, PNorm = 798.9374, GNorm = 1.7434, lr_0 = 9.9341e-04
Loss = 3.3125e-02, PNorm = 798.9702, GNorm = 1.0153, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.093617
Epoch 9340
Loss = 3.8673e-02, PNorm = 799.0071, GNorm = 1.4786, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.099814
Epoch 9341
Loss = 4.3598e-02, PNorm = 799.0400, GNorm = 2.0224, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.108043
Epoch 9342
Loss = 3.2929e-02, PNorm = 799.0768, GNorm = 2.3958, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.120487
Epoch 9343
Loss = 4.2914e-02, PNorm = 799.1139, GNorm = 1.8228, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.122399
Epoch 9344
Loss = 1.7404e-02, PNorm = 799.1436, GNorm = 0.2305, lr_0 = 9.9341e-04
Validation binary_cross_entropy = 0.120879
Epoch 9345
Loss = 2.2040e-02, PNorm = 799.1644, GNorm = 1.9204, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.120730
Epoch 9346
Loss = 1.5688e-01, PNorm = 799.1853, GNorm = 3.7986, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.103833
Epoch 9347
Loss = 3.3615e-02, PNorm = 799.2142, GNorm = 0.8588, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.133763
Epoch 9348
Loss = 2.7531e-03, PNorm = 799.2643, GNorm = 0.0882, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.171953
Epoch 9349
Loss = 8.2281e-03, PNorm = 799.3071, GNorm = 0.6544, lr_0 = 9.9340e-04
Loss = 3.7337e-02, PNorm = 799.3503, GNorm = 0.4843, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.146257
Epoch 9350
Loss = 5.2034e-02, PNorm = 799.3896, GNorm = 1.3437, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.118149
Epoch 9351
Loss = 3.1652e-02, PNorm = 799.4250, GNorm = 4.4312, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.258399
Epoch 9352
Loss = 1.9177e-02, PNorm = 799.4578, GNorm = 4.5917, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.257210
Epoch 9353
Loss = 1.2428e-01, PNorm = 799.4828, GNorm = 2.8798, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.096458
Epoch 9354
Loss = 8.1114e-02, PNorm = 799.5466, GNorm = 6.8496, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.112540
Epoch 9355
Loss = 5.7586e-02, PNorm = 799.6031, GNorm = 2.2363, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.117797
Epoch 9356
Loss = 2.4849e-02, PNorm = 799.6477, GNorm = 0.2432, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.109427
Epoch 9357
Loss = 1.9713e-02, PNorm = 799.6866, GNorm = 0.2725, lr_0 = 9.9340e-04
Validation binary_cross_entropy = 0.122008
Epoch 9358
Loss = 6.9359e-02, PNorm = 799.7260, GNorm = 3.6312, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.112205
Epoch 9359
Loss = 5.8326e-02, PNorm = 799.7572, GNorm = 1.5336, lr_0 = 9.9339e-04
Loss = 4.1918e-02, PNorm = 799.7831, GNorm = 0.1773, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.104042
Epoch 9360
Loss = 2.8448e-02, PNorm = 799.8127, GNorm = 2.9568, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.101700
Epoch 9361
Loss = 1.8802e-02, PNorm = 799.8541, GNorm = 0.4219, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.108949
Epoch 9362
Loss = 2.2396e-02, PNorm = 799.8906, GNorm = 0.7721, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.127767
Epoch 9363
Loss = 1.0909e-02, PNorm = 799.9240, GNorm = 1.9953, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.220821
Epoch 9364
Loss = 1.0881e-01, PNorm = 799.9380, GNorm = 0.0161, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.139498
Epoch 9365
Loss = 7.7610e-02, PNorm = 799.9648, GNorm = 7.3359, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.122225
Epoch 9366
Loss = 4.6908e-02, PNorm = 800.0206, GNorm = 1.3502, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.126068
Epoch 9367
Loss = 7.9546e-02, PNorm = 800.0551, GNorm = 1.3338, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.096933
Epoch 9368
Loss = 2.8558e-02, PNorm = 800.0896, GNorm = 0.2456, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.091262
Epoch 9369
Loss = 1.3364e-02, PNorm = 800.1266, GNorm = 0.3189, lr_0 = 9.9339e-04
Loss = 1.8220e-02, PNorm = 800.1663, GNorm = 0.3004, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.098399
Epoch 9370
Loss = 1.6019e-02, PNorm = 800.1974, GNorm = 0.2881, lr_0 = 9.9339e-04
Validation binary_cross_entropy = 0.107443
Epoch 9371
Loss = 2.9153e-02, PNorm = 800.2162, GNorm = 0.5177, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.107686
Epoch 9372
Loss = 1.0997e-02, PNorm = 800.2398, GNorm = 0.5779, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.120929
Epoch 9373
Loss = 5.7796e-02, PNorm = 800.2702, GNorm = 0.8627, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.121855
Epoch 9374
Loss = 3.0747e-02, PNorm = 800.3000, GNorm = 1.4225, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.141379
Epoch 9375
Loss = 1.6296e-02, PNorm = 800.3270, GNorm = 1.1523, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.193684
Epoch 9376
Loss = 3.9278e-03, PNorm = 800.3506, GNorm = 1.3209, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.151585
Epoch 9377
Loss = 1.8153e-01, PNorm = 800.3724, GNorm = 0.1168, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.101489
Epoch 9378
Loss = 4.6034e-03, PNorm = 800.4014, GNorm = 0.1473, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.096146
Epoch 9379
Loss = 6.7407e-03, PNorm = 800.4331, GNorm = 0.5670, lr_0 = 9.9338e-04
Loss = 4.0327e-02, PNorm = 800.4685, GNorm = 1.0617, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.107949
Epoch 9380
Loss = 5.0538e-02, PNorm = 800.4939, GNorm = 1.1734, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.104065
Epoch 9381
Loss = 3.5930e-02, PNorm = 800.5152, GNorm = 2.0659, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.096712
Epoch 9382
Loss = 2.7984e-02, PNorm = 800.5441, GNorm = 1.1400, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.098118
Epoch 9383
Loss = 3.5435e-02, PNorm = 800.5762, GNorm = 0.8723, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.091744
Epoch 9384
Loss = 8.7769e-03, PNorm = 800.6174, GNorm = 1.5585, lr_0 = 9.9338e-04
Validation binary_cross_entropy = 0.091918
Epoch 9385
Loss = 3.5878e-02, PNorm = 800.6577, GNorm = 1.1100, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.117649
Epoch 9386
Loss = 5.2023e-02, PNorm = 800.6992, GNorm = 2.0300, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.135003
Epoch 9387
Loss = 2.9016e-02, PNorm = 800.7247, GNorm = 1.1682, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.119438
Epoch 9388
Loss = 9.2747e-02, PNorm = 800.7416, GNorm = 7.3737, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.117661
Epoch 9389
Loss = 2.2430e-01, PNorm = 800.7840, GNorm = 14.2176, lr_0 = 9.9337e-04
Loss = 4.3407e-02, PNorm = 800.8565, GNorm = 0.5407, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.133121
Epoch 9390
Loss = 2.6887e-02, PNorm = 800.8952, GNorm = 0.1125, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.116354
Epoch 9391
Loss = 5.8417e-02, PNorm = 800.9240, GNorm = 6.8524, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.124200
Epoch 9392
Loss = 2.6743e-02, PNorm = 800.9525, GNorm = 0.1935, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.106225
Epoch 9393
Loss = 5.6041e-02, PNorm = 800.9766, GNorm = 0.9615, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.097465
Epoch 9394
Loss = 2.8888e-02, PNorm = 801.0137, GNorm = 0.7967, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.103648
Epoch 9395
Loss = 7.3165e-03, PNorm = 801.0522, GNorm = 0.4374, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.114084
Epoch 9396
Loss = 3.6170e-02, PNorm = 801.0799, GNorm = 0.9900, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.106149
Epoch 9397
Loss = 1.3146e-02, PNorm = 801.0982, GNorm = 0.0976, lr_0 = 9.9337e-04
Validation binary_cross_entropy = 0.112622
Epoch 9398
Loss = 1.3013e-02, PNorm = 801.1174, GNorm = 3.2751, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.118945
Epoch 9399
Loss = 6.0761e-04, PNorm = 801.1455, GNorm = 0.0805, lr_0 = 9.9336e-04
Loss = 1.5375e-02, PNorm = 801.1726, GNorm = 0.1028, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.126993
Epoch 9400
Loss = 6.5007e-02, PNorm = 801.2120, GNorm = 4.0391, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.191157
Epoch 9401
Loss = 1.8889e-02, PNorm = 801.2369, GNorm = 1.3163, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.158858
Epoch 9402
Loss = 1.2152e-02, PNorm = 801.2549, GNorm = 1.1391, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.142650
Epoch 9403
Loss = 6.9938e-02, PNorm = 801.2729, GNorm = 18.3382, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.121569
Epoch 9404
Loss = 1.2099e-02, PNorm = 801.3084, GNorm = 1.3272, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.139603
Epoch 9405
Loss = 4.6008e-02, PNorm = 801.3354, GNorm = 1.4388, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.139966
Epoch 9406
Loss = 7.9964e-03, PNorm = 801.3634, GNorm = 0.0346, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.138855
Epoch 9407
Loss = 6.7642e-03, PNorm = 801.3820, GNorm = 0.0935, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.121858
Epoch 9408
Loss = 8.6389e-03, PNorm = 801.3971, GNorm = 0.0790, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.104130
Epoch 9409
Loss = 4.4991e-02, PNorm = 801.4153, GNorm = 3.7159, lr_0 = 9.9336e-04
Loss = 3.0438e-02, PNorm = 801.4457, GNorm = 2.1802, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.103172
Epoch 9410
Loss = 1.0067e-02, PNorm = 801.4808, GNorm = 1.4653, lr_0 = 9.9336e-04
Validation binary_cross_entropy = 0.116657
Epoch 9411
Loss = 2.8946e-02, PNorm = 801.4973, GNorm = 1.3658, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.113588
Epoch 9412
Loss = 6.9070e-03, PNorm = 801.5086, GNorm = 0.0887, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.119235
Epoch 9413
Loss = 7.5333e-02, PNorm = 801.5287, GNorm = 1.2853, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.125670
Epoch 9414
Loss = 1.7202e-02, PNorm = 801.5514, GNorm = 0.6107, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.120336
Epoch 9415
Loss = 5.3346e-03, PNorm = 801.5671, GNorm = 0.5392, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.126786
Epoch 9416
Loss = 3.4710e-03, PNorm = 801.5874, GNorm = 0.1755, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.106650
Epoch 9417
Loss = 1.5898e-02, PNorm = 801.6275, GNorm = 0.1747, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.121982
Epoch 9418
Loss = 3.0393e-03, PNorm = 801.6811, GNorm = 0.2297, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.167569
Epoch 9419
Loss = 2.8196e-02, PNorm = 801.7230, GNorm = 1.1238, lr_0 = 9.9335e-04
Loss = 1.8753e-02, PNorm = 801.7425, GNorm = 0.5585, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.154951
Epoch 9420
Loss = 7.2454e-03, PNorm = 801.7587, GNorm = 0.0576, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.151766
Epoch 9421
Loss = 1.0275e-02, PNorm = 801.7758, GNorm = 0.1665, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.172825
Epoch 9422
Loss = 4.7197e-02, PNorm = 801.7949, GNorm = 1.3127, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.170473
Epoch 9423
Loss = 7.8250e-03, PNorm = 801.8147, GNorm = 0.2549, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.132496
Epoch 9424
Loss = 2.0767e-02, PNorm = 801.8350, GNorm = 0.0575, lr_0 = 9.9335e-04
Validation binary_cross_entropy = 0.133910
Epoch 9425
Loss = 4.1952e-02, PNorm = 801.8618, GNorm = 0.1749, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.128511
Epoch 9426
Loss = 1.2337e-02, PNorm = 801.8853, GNorm = 0.9975, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.142909
Epoch 9427
Loss = 2.4998e-02, PNorm = 801.9087, GNorm = 0.2846, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.152410
Epoch 9428
Loss = 2.4320e-02, PNorm = 801.9257, GNorm = 2.1440, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.154157
Epoch 9429
Loss = 2.0696e-02, PNorm = 801.9364, GNorm = 2.4093, lr_0 = 9.9334e-04
Loss = 9.7589e-03, PNorm = 801.9572, GNorm = 1.2624, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.163582
Epoch 9430
Loss = 9.9769e-03, PNorm = 801.9735, GNorm = 0.1311, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.165867
Epoch 9431
Loss = 1.1337e-02, PNorm = 801.9898, GNorm = 0.5676, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.165351
Epoch 9432
Loss = 1.0529e-02, PNorm = 802.0210, GNorm = 0.0123, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.181373
Epoch 9433
Loss = 2.2297e-02, PNorm = 802.0463, GNorm = 0.2362, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.169065
Epoch 9434
Loss = 6.6412e-02, PNorm = 802.0623, GNorm = 3.2725, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.110115
Epoch 9435
Loss = 1.4923e-02, PNorm = 802.0950, GNorm = 0.8356, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.106148
Epoch 9436
Loss = 1.9660e-02, PNorm = 802.1444, GNorm = 0.2016, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.114374
Epoch 9437
Loss = 1.2681e-02, PNorm = 802.1795, GNorm = 4.3103, lr_0 = 9.9334e-04
Validation binary_cross_entropy = 0.110633
Epoch 9438
Loss = 4.6264e-01, PNorm = 802.2111, GNorm = 0.1813, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.087150
Epoch 9439
Loss = 9.5427e-02, PNorm = 802.2779, GNorm = 6.8840, lr_0 = 9.9333e-04
Loss = 2.8985e-02, PNorm = 802.3492, GNorm = 1.8213, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.099660
Epoch 9440
Loss = 5.5023e-02, PNorm = 802.3883, GNorm = 0.3749, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.081281
Epoch 9441
Loss = 1.5355e-01, PNorm = 802.4343, GNorm = 2.8587, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.066115
Epoch 9442
Loss = 7.9875e-02, PNorm = 802.5073, GNorm = 20.4075, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.109908
Epoch 9443
Loss = 1.2225e-01, PNorm = 802.5618, GNorm = 8.4261, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.136445
Epoch 9444
Loss = 5.0245e-02, PNorm = 802.6189, GNorm = 2.3926, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.092718
Epoch 9445
Loss = 7.8360e-02, PNorm = 802.6730, GNorm = 3.2751, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.112627
Epoch 9446
Loss = 2.9079e-02, PNorm = 802.7114, GNorm = 0.7584, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.106257
Epoch 9447
Loss = 6.9361e-02, PNorm = 802.7616, GNorm = 9.2484, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.081076
Epoch 9448
Loss = 5.0297e-02, PNorm = 802.8206, GNorm = 8.4465, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.092457
Epoch 9449
Loss = 8.8278e-03, PNorm = 802.8891, GNorm = 0.7678, lr_0 = 9.9333e-04
Loss = 3.4911e-02, PNorm = 802.9342, GNorm = 0.3865, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.100578
Epoch 9450
Loss = 5.5299e-02, PNorm = 802.9786, GNorm = 6.3133, lr_0 = 9.9333e-04
Validation binary_cross_entropy = 0.099922
Epoch 9451
Loss = 3.9286e-02, PNorm = 803.0395, GNorm = 1.1713, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.098519
Epoch 9452
Loss = 4.6603e-02, PNorm = 803.1007, GNorm = 0.8390, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.101309
Epoch 9453
Loss = 3.1806e-02, PNorm = 803.1437, GNorm = 2.7487, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.108650
Epoch 9454
Loss = 1.4343e-02, PNorm = 803.1872, GNorm = 0.8657, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.117919
Epoch 9455
Loss = 1.0589e-01, PNorm = 803.2286, GNorm = 11.1542, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.107100
Epoch 9456
Loss = 6.2151e-02, PNorm = 803.2851, GNorm = 3.1332, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.114165
Epoch 9457
Loss = 2.8780e-02, PNorm = 803.3319, GNorm = 0.3730, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.115248
Epoch 9458
Loss = 4.5661e-01, PNorm = 803.3717, GNorm = 4.2002, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.118173
Epoch 9459
Loss = 6.4340e-02, PNorm = 803.4129, GNorm = 1.8112, lr_0 = 9.9332e-04
Loss = 3.0502e-02, PNorm = 803.4571, GNorm = 2.2791, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.136141
Epoch 9460
Loss = 3.3146e-02, PNorm = 803.4913, GNorm = 0.2889, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.124427
Epoch 9461
Loss = 7.4024e-02, PNorm = 803.5263, GNorm = 0.9494, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.103765
Epoch 9462
Loss = 5.0305e-02, PNorm = 803.5774, GNorm = 0.1629, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.113852
Epoch 9463
Loss = 4.2629e-02, PNorm = 803.6282, GNorm = 0.7922, lr_0 = 9.9332e-04
Validation binary_cross_entropy = 0.114826
Epoch 9464
Loss = 8.2213e-02, PNorm = 803.6762, GNorm = 0.1795, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.111226
Epoch 9465
Loss = 7.8769e-02, PNorm = 803.7151, GNorm = 14.9181, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.115972
Epoch 9466
Loss = 5.7419e-03, PNorm = 803.7481, GNorm = 0.0948, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.131961
Epoch 9467
Loss = 2.3366e-02, PNorm = 803.7723, GNorm = 5.6132, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.135291
Epoch 9468
Loss = 6.7563e-02, PNorm = 803.7999, GNorm = 3.0974, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.106935
Epoch 9469
Loss = 5.1993e-02, PNorm = 803.8266, GNorm = 2.2876, lr_0 = 9.9331e-04
Loss = 6.9510e-02, PNorm = 803.8653, GNorm = 0.4403, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.105646
Epoch 9470
Loss = 1.7957e-02, PNorm = 803.8955, GNorm = 0.4447, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.103676
Epoch 9471
Loss = 3.5390e-02, PNorm = 803.9155, GNorm = 0.6288, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.101334
Epoch 9472
Loss = 7.6133e-02, PNorm = 803.9517, GNorm = 3.4101, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.110591
Epoch 9473
Loss = 1.7489e-02, PNorm = 803.9816, GNorm = 1.0691, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.084764
Epoch 9474
Loss = 9.3114e-02, PNorm = 804.0186, GNorm = 1.4986, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.096340
Epoch 9475
Loss = 6.5158e-03, PNorm = 804.0677, GNorm = 0.3545, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.117429
Epoch 9476
Loss = 5.5049e-02, PNorm = 804.1082, GNorm = 3.3232, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.078066
Epoch 9477
Loss = 4.8817e-01, PNorm = 804.1989, GNorm = 32.7706, lr_0 = 9.9331e-04
Validation binary_cross_entropy = 0.351821
Epoch 9478
Loss = 1.7183e-01, PNorm = 804.3421, GNorm = 3.6053, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.102779
Epoch 9479
Loss = 1.6307e-02, PNorm = 804.4127, GNorm = 1.5975, lr_0 = 9.9330e-04
Loss = 1.0298e-01, PNorm = 804.4768, GNorm = 0.4725, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.131320
Epoch 9480
Loss = 8.3827e-02, PNorm = 804.5294, GNorm = 0.8202, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.128623
Epoch 9481
Loss = 5.1507e-02, PNorm = 804.5679, GNorm = 6.3103, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.111977
Epoch 9482
Loss = 1.4881e-01, PNorm = 804.6207, GNorm = 0.9384, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.117901
Epoch 9483
Loss = 6.5549e-02, PNorm = 804.6785, GNorm = 0.9570, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.130033
Epoch 9484
Loss = 1.1397e-01, PNorm = 804.7312, GNorm = 0.4230, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.113901
Epoch 9485
Loss = 4.2936e-02, PNorm = 804.7913, GNorm = 0.5168, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.120528
Epoch 9486
Loss = 7.3081e-02, PNorm = 804.8481, GNorm = 9.5873, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.129476
Epoch 9487
Loss = 6.8197e-02, PNorm = 804.9006, GNorm = 0.6270, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.129756
Epoch 9488
Loss = 8.4863e-03, PNorm = 804.9512, GNorm = 0.6825, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.109648
Epoch 9489
Loss = 1.7900e-01, PNorm = 804.9838, GNorm = 6.5923, lr_0 = 9.9330e-04
Loss = 4.2300e-02, PNorm = 805.0351, GNorm = 2.2132, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.110926
Epoch 9490
Loss = 3.9927e-02, PNorm = 805.0835, GNorm = 0.6014, lr_0 = 9.9330e-04
Validation binary_cross_entropy = 0.104581
Epoch 9491
Loss = 3.3517e-02, PNorm = 805.1219, GNorm = 0.2398, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.108266
Epoch 9492
Loss = 7.4189e-02, PNorm = 805.1618, GNorm = 3.3318, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.142419
Epoch 9493
Loss = 6.1386e-02, PNorm = 805.1934, GNorm = 1.6350, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.129058
Epoch 9494
Loss = 5.2043e-02, PNorm = 805.2097, GNorm = 0.3262, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.098761
Epoch 9495
Loss = 9.2402e-02, PNorm = 805.2424, GNorm = 0.2835, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.103230
Epoch 9496
Loss = 1.9874e-02, PNorm = 805.2890, GNorm = 0.9418, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.109070
Epoch 9497
Loss = 4.9185e-02, PNorm = 805.3223, GNorm = 9.1295, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.108794
Epoch 9498
Loss = 2.4093e-01, PNorm = 805.3517, GNorm = 8.4220, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.087952
Epoch 9499
Loss = 3.3800e-02, PNorm = 805.3839, GNorm = 1.4480, lr_0 = 9.9329e-04
Loss = 3.8081e-02, PNorm = 805.4229, GNorm = 2.2657, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.090139
Epoch 9500
Loss = 5.1490e-02, PNorm = 805.4707, GNorm = 1.7254, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.116198
Epoch 9501
Loss = 2.4392e-02, PNorm = 805.5044, GNorm = 0.3078, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.124644
Epoch 9502
Loss = 1.1937e-02, PNorm = 805.5308, GNorm = 0.8510, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.121701
Epoch 9503
Loss = 7.2441e-02, PNorm = 805.5456, GNorm = 0.1894, lr_0 = 9.9329e-04
Validation binary_cross_entropy = 0.095915
Epoch 9504
Loss = 5.5074e-02, PNorm = 805.5822, GNorm = 1.1677, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.125148
Epoch 9505
Loss = 2.1609e-02, PNorm = 805.6269, GNorm = 0.6187, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.225969
Epoch 9506
Loss = 1.5645e-01, PNorm = 805.6640, GNorm = 0.4901, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.117110
Epoch 9507
Loss = 3.0994e-03, PNorm = 805.7246, GNorm = 0.1143, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.145653
Epoch 9508
Loss = 2.5528e-02, PNorm = 805.7928, GNorm = 0.6062, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.169218
Epoch 9509
Loss = 1.4765e-01, PNorm = 805.8316, GNorm = 11.1325, lr_0 = 9.9328e-04
Loss = 1.8786e-02, PNorm = 805.8763, GNorm = 0.3180, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.163433
Epoch 9510
Loss = 1.3592e-01, PNorm = 805.9189, GNorm = 4.1374, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.236383
Epoch 9511
Loss = 3.0170e-02, PNorm = 805.9463, GNorm = 1.8639, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.182934
Epoch 9512
Loss = 1.0952e-01, PNorm = 805.9731, GNorm = 2.3001, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.135617
Epoch 9513
Loss = 5.2723e-02, PNorm = 806.0096, GNorm = 2.3958, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.130608
Epoch 9514
Loss = 2.5584e-02, PNorm = 806.0499, GNorm = 0.4086, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.149403
Epoch 9515
Loss = 6.8739e-02, PNorm = 806.0924, GNorm = 0.7345, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.149745
Epoch 9516
Loss = 3.0072e-01, PNorm = 806.1363, GNorm = 1.6244, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.131909
Epoch 9517
Loss = 9.4419e-03, PNorm = 806.1728, GNorm = 0.1998, lr_0 = 9.9328e-04
Validation binary_cross_entropy = 0.139418
Epoch 9518
Loss = 2.4613e-02, PNorm = 806.2152, GNorm = 0.9956, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.142294
Epoch 9519
Loss = 7.0980e-03, PNorm = 806.2508, GNorm = 0.5249, lr_0 = 9.9327e-04
Loss = 6.5092e-02, PNorm = 806.2869, GNorm = 14.2383, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.108349
Epoch 9520
Loss = 6.8201e-02, PNorm = 806.3316, GNorm = 1.7107, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.117738
Epoch 9521
Loss = 1.2216e-01, PNorm = 806.3722, GNorm = 1.5384, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.084855
Epoch 9522
Loss = 6.7951e-02, PNorm = 806.4165, GNorm = 1.9249, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.083018
Epoch 9523
Loss = 2.2109e-02, PNorm = 806.4811, GNorm = 0.2756, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.105239
Epoch 9524
Loss = 5.9112e-02, PNorm = 806.5246, GNorm = 0.2148, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.099466
Epoch 9525
Loss = 3.4808e-02, PNorm = 806.5601, GNorm = 0.9360, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.099795
Epoch 9526
Loss = 4.8280e-02, PNorm = 806.6025, GNorm = 7.5899, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.137990
Epoch 9527
Loss = 1.1480e-01, PNorm = 806.6422, GNorm = 5.0967, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.117517
Epoch 9528
Loss = 4.9628e-02, PNorm = 806.6711, GNorm = 4.2338, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.116226
Epoch 9529
Loss = 1.0663e-02, PNorm = 806.6981, GNorm = 1.1068, lr_0 = 9.9327e-04
Loss = 5.8638e-02, PNorm = 806.7141, GNorm = 0.7776, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.107146
Epoch 9530
Loss = 2.6814e-02, PNorm = 806.7423, GNorm = 2.1460, lr_0 = 9.9327e-04
Validation binary_cross_entropy = 0.109545
Epoch 9531
Loss = 4.6634e-01, PNorm = 806.7718, GNorm = 1.4131, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.130247
Epoch 9532
Loss = 7.7618e-02, PNorm = 806.8138, GNorm = 6.0059, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.109869
Epoch 9533
Loss = 5.2455e-02, PNorm = 806.8491, GNorm = 4.4507, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.088841
Epoch 9534
Loss = 1.0547e-01, PNorm = 806.8834, GNorm = 1.0185, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.096816
Epoch 9535
Loss = 2.9864e-02, PNorm = 806.9115, GNorm = 2.0986, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.080766
Epoch 9536
Loss = 3.9982e-02, PNorm = 806.9493, GNorm = 1.2408, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.086599
Epoch 9537
Loss = 2.7144e-02, PNorm = 806.9973, GNorm = 1.2994, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.108321
Epoch 9538
Loss = 4.0732e-02, PNorm = 807.0364, GNorm = 2.7167, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.100618
Epoch 9539
Loss = 8.4505e-03, PNorm = 807.0660, GNorm = 0.2558, lr_0 = 9.9326e-04
Loss = 2.3515e-02, PNorm = 807.1013, GNorm = 0.4882, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.093727
Epoch 9540
Loss = 2.9757e-02, PNorm = 807.1318, GNorm = 2.3868, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.099308
Epoch 9541
Loss = 2.4080e-02, PNorm = 807.1597, GNorm = 0.6874, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.108027
Epoch 9542
Loss = 2.3456e-02, PNorm = 807.1883, GNorm = 0.0560, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.117460
Epoch 9543
Loss = 4.1156e-02, PNorm = 807.2092, GNorm = 1.1347, lr_0 = 9.9326e-04
Validation binary_cross_entropy = 0.114861
Epoch 9544
Loss = 1.1508e-01, PNorm = 807.2286, GNorm = 5.9085, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.091932
Epoch 9545
Loss = 3.6630e-02, PNorm = 807.2780, GNorm = 0.1539, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.084423
Epoch 9546
Loss = 3.0156e-02, PNorm = 807.3193, GNorm = 4.8119, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.132580
Epoch 9547
Loss = 1.2230e-01, PNorm = 807.3535, GNorm = 6.8382, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.085522
Epoch 9548
Loss = 1.7296e-02, PNorm = 807.3817, GNorm = 0.5626, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.089316
Epoch 9549
Loss = 4.9571e-03, PNorm = 807.4181, GNorm = 0.6430, lr_0 = 9.9325e-04
Loss = 2.5615e-02, PNorm = 807.4500, GNorm = 1.8222, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.100036
Epoch 9550
Loss = 4.1741e-02, PNorm = 807.4698, GNorm = 3.2680, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.088639
Epoch 9551
Loss = 6.8584e-02, PNorm = 807.4999, GNorm = 0.8826, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.093090
Epoch 9552
Loss = 2.2023e-02, PNorm = 807.5268, GNorm = 0.8624, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.086946
Epoch 9553
Loss = 1.4040e-02, PNorm = 807.5439, GNorm = 0.7063, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.082942
Epoch 9554
Loss = 2.8094e-02, PNorm = 807.5618, GNorm = 2.5948, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.086318
Epoch 9555
Loss = 1.6750e-02, PNorm = 807.5902, GNorm = 0.0864, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.090063
Epoch 9556
Loss = 3.1743e-03, PNorm = 807.6173, GNorm = 0.0598, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.099167
Epoch 9557
Loss = 2.6569e-03, PNorm = 807.6393, GNorm = 0.1217, lr_0 = 9.9325e-04
Validation binary_cross_entropy = 0.100986
Epoch 9558
Loss = 9.3867e-03, PNorm = 807.6580, GNorm = 1.5253, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.098421
Epoch 9559
Loss = 1.7259e-03, PNorm = 807.6766, GNorm = 0.1430, lr_0 = 9.9324e-04
Loss = 2.4385e-02, PNorm = 807.6941, GNorm = 0.9237, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.090295
Epoch 9560
Loss = 1.2732e-02, PNorm = 807.7171, GNorm = 0.1551, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.093370
Epoch 9561
Loss = 4.2433e-02, PNorm = 807.7491, GNorm = 0.1675, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.116783
Epoch 9562
Loss = 9.6618e-02, PNorm = 807.7712, GNorm = 0.0572, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.089050
Epoch 9563
Loss = 6.4254e-03, PNorm = 807.7912, GNorm = 1.0456, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.088360
Epoch 9564
Loss = 1.3361e-02, PNorm = 807.8181, GNorm = 0.9919, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.093049
Epoch 9565
Loss = 1.8707e-02, PNorm = 807.8434, GNorm = 5.8058, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.100223
Epoch 9566
Loss = 2.2123e-02, PNorm = 807.8763, GNorm = 0.0936, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.161094
Epoch 9567
Loss = 2.8650e-02, PNorm = 807.9014, GNorm = 6.0706, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.184789
Epoch 9568
Loss = 4.4955e-03, PNorm = 807.9249, GNorm = 0.1524, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.141682
Epoch 9569
Loss = 9.3357e-03, PNorm = 807.9477, GNorm = 1.1544, lr_0 = 9.9324e-04
Loss = 1.0419e-02, PNorm = 807.9763, GNorm = 1.3014, lr_0 = 9.9324e-04
Validation binary_cross_entropy = 0.133819
Epoch 9570
Loss = 3.5559e-02, PNorm = 808.0099, GNorm = 7.3741, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.086756
Epoch 9571
Loss = 1.6578e-02, PNorm = 808.0462, GNorm = 0.7702, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.080246
Epoch 9572
Loss = 1.4374e-02, PNorm = 808.0841, GNorm = 0.1566, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.083541
Epoch 9573
Loss = 4.6867e-02, PNorm = 808.1166, GNorm = 0.9524, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.087228
Epoch 9574
Loss = 4.6667e-02, PNorm = 808.1462, GNorm = 2.3720, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.084951
Epoch 9575
Loss = 3.4443e-02, PNorm = 808.1778, GNorm = 0.4013, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.091100
Epoch 9576
Loss = 4.2282e-03, PNorm = 808.2041, GNorm = 1.0353, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.081288
Epoch 9577
Loss = 7.1401e-02, PNorm = 808.2212, GNorm = 0.3224, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.070278
Epoch 9578
Loss = 4.7557e-02, PNorm = 808.2512, GNorm = 0.4404, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.069982
Epoch 9579
Loss = 5.2584e-02, PNorm = 808.3089, GNorm = 0.6327, lr_0 = 9.9323e-04
Loss = 3.3821e-02, PNorm = 808.3538, GNorm = 0.3703, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.076430
Epoch 9580
Loss = 3.0068e-02, PNorm = 808.3913, GNorm = 0.1988, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.077735
Epoch 9581
Loss = 1.2730e-02, PNorm = 808.4208, GNorm = 0.6389, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.084004
Epoch 9582
Loss = 2.8302e-02, PNorm = 808.4492, GNorm = 0.2589, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.088171
Epoch 9583
Loss = 2.3477e-02, PNorm = 808.4791, GNorm = 2.3911, lr_0 = 9.9323e-04
Validation binary_cross_entropy = 0.076697
Epoch 9584
Loss = 5.0533e-02, PNorm = 808.5118, GNorm = 5.8227, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.079166
Epoch 9585
Loss = 1.5476e-02, PNorm = 808.5467, GNorm = 0.3577, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.099203
Epoch 9586
Loss = 4.6991e-02, PNorm = 808.5808, GNorm = 0.2665, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.090106
Epoch 9587
Loss = 2.1023e-01, PNorm = 808.6115, GNorm = 4.8702, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.079011
Epoch 9588
Loss = 2.3514e-02, PNorm = 808.6462, GNorm = 0.2852, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.079829
Epoch 9589
Loss = 4.0556e-02, PNorm = 808.6860, GNorm = 1.9787, lr_0 = 9.9322e-04
Loss = 2.8934e-02, PNorm = 808.7252, GNorm = 1.1428, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.090757
Epoch 9590
Loss = 1.7542e-02, PNorm = 808.7516, GNorm = 1.4990, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.103071
Epoch 9591
Loss = 1.0272e-01, PNorm = 808.7797, GNorm = 7.4136, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.086541
Epoch 9592
Loss = 2.6588e-02, PNorm = 808.8201, GNorm = 6.7199, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.090288
Epoch 9593
Loss = 6.5257e-02, PNorm = 808.8642, GNorm = 0.3636, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.111205
Epoch 9594
Loss = 2.4304e-02, PNorm = 808.8934, GNorm = 0.1113, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.104705
Epoch 9595
Loss = 3.1299e-02, PNorm = 808.9141, GNorm = 5.9803, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.088068
Epoch 9596
Loss = 7.4175e-02, PNorm = 808.9392, GNorm = 3.1476, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.082865
Epoch 9597
Loss = 1.9520e-02, PNorm = 808.9779, GNorm = 0.4448, lr_0 = 9.9322e-04
Validation binary_cross_entropy = 0.084722
Epoch 9598
Loss = 2.3447e-02, PNorm = 809.0126, GNorm = 1.7853, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.087293
Epoch 9599
Loss = 8.4993e-02, PNorm = 809.0437, GNorm = 8.2065, lr_0 = 9.9321e-04
Loss = 5.0932e-02, PNorm = 809.0913, GNorm = 0.5166, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.101027
Epoch 9600
Loss = 4.6269e-02, PNorm = 809.1203, GNorm = 0.6343, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.101072
Epoch 9601
Loss = 2.0831e-02, PNorm = 809.1412, GNorm = 2.7622, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.123717
Epoch 9602
Loss = 1.2097e-02, PNorm = 809.1557, GNorm = 0.2641, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.099936
Epoch 9603
Loss = 3.7686e-02, PNorm = 809.1658, GNorm = 0.8069, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.077286
Epoch 9604
Loss = 1.3756e-01, PNorm = 809.1949, GNorm = 19.2761, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.079088
Epoch 9605
Loss = 1.2635e-02, PNorm = 809.2556, GNorm = 0.5929, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.087861
Epoch 9606
Loss = 1.1663e-02, PNorm = 809.2932, GNorm = 3.7947, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.091588
Epoch 9607
Loss = 8.9812e-03, PNorm = 809.3170, GNorm = 0.1731, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.110480
Epoch 9608
Loss = 2.3728e-02, PNorm = 809.3684, GNorm = 1.3752, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.139008
Epoch 9609
Loss = 1.4948e-02, PNorm = 809.4256, GNorm = 0.8366, lr_0 = 9.9321e-04
Loss = 3.2076e-02, PNorm = 809.4621, GNorm = 0.2457, lr_0 = 9.9321e-04
Validation binary_cross_entropy = 0.135369
Epoch 9610
Loss = 4.1481e-02, PNorm = 809.4931, GNorm = 1.5953, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.130322
Epoch 9611
Loss = 9.0686e-02, PNorm = 809.5311, GNorm = 0.9982, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.124815
Epoch 9612
Loss = 2.7492e-02, PNorm = 809.5899, GNorm = 0.4814, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.144768
Epoch 9613
Loss = 4.6098e-02, PNorm = 809.6420, GNorm = 7.2646, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.205200
Epoch 9614
Loss = 2.1748e-02, PNorm = 809.6772, GNorm = 0.2192, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.170640
Epoch 9615
Loss = 2.5071e-02, PNorm = 809.7037, GNorm = 0.2014, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.154925
Epoch 9616
Loss = 5.6097e-03, PNorm = 809.7239, GNorm = 0.4149, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.201931
Epoch 9617
Loss = 6.8934e-03, PNorm = 809.7519, GNorm = 0.0097, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.202965
Epoch 9618
Loss = 2.2398e-01, PNorm = 809.7813, GNorm = 0.1390, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.145357
Epoch 9619
Loss = 5.5233e-02, PNorm = 809.8110, GNorm = 0.9007, lr_0 = 9.9320e-04
Loss = 3.8779e-02, PNorm = 809.8571, GNorm = 1.1682, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.144280
Epoch 9620
Loss = 1.1943e-02, PNorm = 809.8968, GNorm = 0.2003, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.144381
Epoch 9621
Loss = 4.3898e-02, PNorm = 809.9314, GNorm = 2.8385, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.135127
Epoch 9622
Loss = 6.8261e-02, PNorm = 809.9822, GNorm = 0.6127, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.139163
Epoch 9623
Loss = 9.8998e-03, PNorm = 810.0292, GNorm = 0.5772, lr_0 = 9.9320e-04
Validation binary_cross_entropy = 0.221502
Epoch 9624
Loss = 3.3766e-02, PNorm = 810.0618, GNorm = 0.6194, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.159941
Epoch 9625
Loss = 8.5720e-02, PNorm = 810.0942, GNorm = 0.3684, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.120698
Epoch 9626
Loss = 1.7279e-02, PNorm = 810.1332, GNorm = 0.4853, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.113522
Epoch 9627
Loss = 3.1863e-02, PNorm = 810.1661, GNorm = 5.4798, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.119458
Epoch 9628
Loss = 6.0052e-02, PNorm = 810.2076, GNorm = 3.3973, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.122303
Epoch 9629
Loss = 1.3117e-03, PNorm = 810.2422, GNorm = 0.2204, lr_0 = 9.9319e-04
Loss = 6.1761e-02, PNorm = 810.2635, GNorm = 12.4390, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.105862
Epoch 9630
Loss = 3.6279e-02, PNorm = 810.2898, GNorm = 0.4781, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.094381
Epoch 9631
Loss = 3.7376e-02, PNorm = 810.3300, GNorm = 2.8001, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.107464
Epoch 9632
Loss = 8.9894e-02, PNorm = 810.3666, GNorm = 2.3370, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.116130
Epoch 9633
Loss = 2.8177e-02, PNorm = 810.3973, GNorm = 0.5942, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.123161
Epoch 9634
Loss = 5.2793e-02, PNorm = 810.4318, GNorm = 0.0609, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.112416
Epoch 9635
Loss = 4.3581e-02, PNorm = 810.4642, GNorm = 0.1613, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.114279
Epoch 9636
Loss = 1.8880e-02, PNorm = 810.4936, GNorm = 0.1301, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.135919
Epoch 9637
Loss = 1.2037e-01, PNorm = 810.5203, GNorm = 0.2036, lr_0 = 9.9319e-04
Validation binary_cross_entropy = 0.114893
Epoch 9638
Loss = 1.7617e-01, PNorm = 810.5406, GNorm = 4.7659, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.109737
Epoch 9639
Loss = 7.4772e-03, PNorm = 810.5760, GNorm = 0.1676, lr_0 = 9.9318e-04
Loss = 2.4841e-02, PNorm = 810.6217, GNorm = 0.2403, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.119522
Epoch 9640
Loss = 1.7532e-02, PNorm = 810.6571, GNorm = 0.1681, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.119816
Epoch 9641
Loss = 2.0339e-01, PNorm = 810.6989, GNorm = 0.9419, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.081834
Epoch 9642
Loss = 3.7360e-02, PNorm = 810.7862, GNorm = 4.8313, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.110916
Epoch 9643
Loss = 5.3002e-02, PNorm = 810.8440, GNorm = 1.6155, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.132561
Epoch 9644
Loss = 6.1763e-02, PNorm = 810.8870, GNorm = 4.4306, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.097566
Epoch 9645
Loss = 2.6182e-02, PNorm = 810.9122, GNorm = 0.3554, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.093757
Epoch 9646
Loss = 5.7370e-02, PNorm = 810.9328, GNorm = 0.5966, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.083590
Epoch 9647
Loss = 1.1951e-02, PNorm = 810.9561, GNorm = 0.4668, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.086957
Epoch 9648
Loss = 3.3433e-02, PNorm = 810.9838, GNorm = 2.4423, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.120872
Epoch 9649
Loss = 2.2250e-03, PNorm = 811.0182, GNorm = 0.1457, lr_0 = 9.9318e-04
Loss = 7.0718e-02, PNorm = 811.0380, GNorm = 2.8558, lr_0 = 9.9318e-04
Validation binary_cross_entropy = 0.113894
Epoch 9650
Loss = 3.3772e-02, PNorm = 811.0636, GNorm = 5.1724, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.091466
Epoch 9651
Loss = 6.8939e-02, PNorm = 811.0939, GNorm = 1.2961, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.098364
Epoch 9652
Loss = 6.4668e-02, PNorm = 811.1197, GNorm = 0.7303, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.081921
Epoch 9653
Loss = 5.1752e-02, PNorm = 811.1536, GNorm = 2.0762, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.088651
Epoch 9654
Loss = 1.3435e-02, PNorm = 811.1851, GNorm = 0.4267, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.095462
Epoch 9655
Loss = 3.9128e-02, PNorm = 811.2079, GNorm = 0.5249, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.090286
Epoch 9656
Loss = 5.1867e-02, PNorm = 811.2329, GNorm = 2.4257, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.105568
Epoch 9657
Loss = 2.6581e-02, PNorm = 811.2544, GNorm = 4.6930, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.114019
Epoch 9658
Loss = 7.8217e-03, PNorm = 811.2898, GNorm = 0.8937, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.113837
Epoch 9659
Loss = 7.1917e-02, PNorm = 811.3120, GNorm = 1.3753, lr_0 = 9.9317e-04
Loss = 5.9572e-02, PNorm = 811.3334, GNorm = 0.2271, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.100855
Epoch 9660
Loss = 3.0622e-02, PNorm = 811.3648, GNorm = 3.1315, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.106495
Epoch 9661
Loss = 1.3069e-02, PNorm = 811.4027, GNorm = 0.0700, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.123607
Epoch 9662
Loss = 1.6077e-02, PNorm = 811.4277, GNorm = 0.1582, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.129098
Epoch 9663
Loss = 4.8371e-03, PNorm = 811.4395, GNorm = 0.0818, lr_0 = 9.9317e-04
Validation binary_cross_entropy = 0.123270
Epoch 9664
Loss = 7.9337e-03, PNorm = 811.4505, GNorm = 0.6936, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.121267
Epoch 9665
Loss = 3.6792e-02, PNorm = 811.4638, GNorm = 4.1611, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.114499
Epoch 9666
Loss = 1.0749e-02, PNorm = 811.4794, GNorm = 1.9965, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.113224
Epoch 9667
Loss = 4.1140e-03, PNorm = 811.4970, GNorm = 0.2660, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.110569
Epoch 9668
Loss = 2.6511e-02, PNorm = 811.5134, GNorm = 0.9729, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.104471
Epoch 9669
Loss = 6.7368e-03, PNorm = 811.5310, GNorm = 0.2031, lr_0 = 9.9316e-04
Loss = 1.4893e-02, PNorm = 811.5530, GNorm = 0.0445, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.189131
Epoch 9670
Loss = 2.5413e-02, PNorm = 811.5683, GNorm = 3.4943, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.195688
Epoch 9671
Loss = 5.9785e-02, PNorm = 811.5833, GNorm = 3.3049, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.187950
Epoch 9672
Loss = 4.6669e-02, PNorm = 811.6079, GNorm = 0.6169, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.153826
Epoch 9673
Loss = 5.8814e-02, PNorm = 811.6309, GNorm = 0.0887, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.150200
Epoch 9674
Loss = 1.2677e-02, PNorm = 811.6620, GNorm = 0.4362, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.152339
Epoch 9675
Loss = 3.0292e-02, PNorm = 811.6821, GNorm = 0.3823, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.112926
Epoch 9676
Loss = 5.1928e-02, PNorm = 811.7225, GNorm = 5.3509, lr_0 = 9.9316e-04
Validation binary_cross_entropy = 0.161931
Epoch 9677
Loss = 2.6034e-02, PNorm = 811.7771, GNorm = 1.7452, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.108222
Epoch 9678
Loss = 9.4118e-03, PNorm = 811.8280, GNorm = 0.7525, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.100612
Epoch 9679
Loss = 7.0598e-03, PNorm = 811.8797, GNorm = 0.3383, lr_0 = 9.9315e-04
Loss = 5.1452e-02, PNorm = 811.9300, GNorm = 0.1667, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.106702
Epoch 9680
Loss = 2.4865e-02, PNorm = 811.9733, GNorm = 1.3792, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.121296
Epoch 9681
Loss = 2.5563e-02, PNorm = 812.0036, GNorm = 1.8096, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.125481
Epoch 9682
Loss = 2.5718e-02, PNorm = 812.0314, GNorm = 1.2171, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.118063
Epoch 9683
Loss = 2.3924e-02, PNorm = 812.0586, GNorm = 1.1055, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.109873
Epoch 9684
Loss = 5.4603e-02, PNorm = 812.0889, GNorm = 2.3250, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.105144
Epoch 9685
Loss = 2.9297e-02, PNorm = 812.1271, GNorm = 4.7349, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.114677
Epoch 9686
Loss = 5.7944e-02, PNorm = 812.1693, GNorm = 28.5991, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.087598
Epoch 9687
Loss = 4.2337e-02, PNorm = 812.2160, GNorm = 0.9236, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.146689
Epoch 9688
Loss = 4.7300e-02, PNorm = 812.2796, GNorm = 8.0722, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.103712
Epoch 9689
Loss = 5.6109e-02, PNorm = 812.3353, GNorm = 1.4257, lr_0 = 9.9315e-04
Loss = 1.6384e-01, PNorm = 812.4219, GNorm = 4.7438, lr_0 = 9.9315e-04
Validation binary_cross_entropy = 0.223323
Epoch 9690
Loss = 5.0758e-02, PNorm = 812.4869, GNorm = 3.1802, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.205131
Epoch 9691
Loss = 9.9934e-02, PNorm = 812.5320, GNorm = 0.4126, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.165276
Epoch 9692
Loss = 1.2413e-01, PNorm = 812.5838, GNorm = 0.2085, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.124583
Epoch 9693
Loss = 5.9482e-02, PNorm = 812.6476, GNorm = 1.2381, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.130338
Epoch 9694
Loss = 6.1349e-02, PNorm = 812.7049, GNorm = 0.7797, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.116020
Epoch 9695
Loss = 5.7915e-02, PNorm = 812.7630, GNorm = 1.4234, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.102096
Epoch 9696
Loss = 4.1639e-02, PNorm = 812.8394, GNorm = 5.8640, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.148681
Epoch 9697
Loss = 1.6326e-02, PNorm = 812.9011, GNorm = 1.3217, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.138820
Epoch 9698
Loss = 2.4476e-02, PNorm = 812.9406, GNorm = 0.3472, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.119495
Epoch 9699
Loss = 4.4527e-02, PNorm = 812.9746, GNorm = 2.2777, lr_0 = 9.9314e-04
Loss = 3.6807e-02, PNorm = 813.0262, GNorm = 0.3583, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.151926
Epoch 9700
Loss = 2.5513e-02, PNorm = 813.0688, GNorm = 1.4109, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.166174
Epoch 9701
Loss = 3.5488e-02, PNorm = 813.0947, GNorm = 0.3438, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.163271
Epoch 9702
Loss = 7.5151e-02, PNorm = 813.1151, GNorm = 3.4649, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.156455
Epoch 9703
Loss = 1.7376e-01, PNorm = 813.1455, GNorm = 0.5351, lr_0 = 9.9314e-04
Validation binary_cross_entropy = 0.140997
Epoch 9704
Loss = 1.2218e-01, PNorm = 813.2144, GNorm = 6.5348, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.257530
Epoch 9705
Loss = 8.3122e-02, PNorm = 813.2740, GNorm = 2.0407, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.170297
Epoch 9706
Loss = 3.7023e-02, PNorm = 813.3223, GNorm = 1.6098, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.149906
Epoch 9707
Loss = 6.4137e-02, PNorm = 813.3637, GNorm = 2.9183, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.139091
Epoch 9708
Loss = 1.1668e-02, PNorm = 813.4056, GNorm = 1.2196, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.164110
Epoch 9709
Loss = 4.2295e-03, PNorm = 813.4375, GNorm = 0.1033, lr_0 = 9.9313e-04
Loss = 3.1470e-02, PNorm = 813.4683, GNorm = 0.7543, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.215770
Epoch 9710
Loss = 7.1468e-02, PNorm = 813.4999, GNorm = 0.5502, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.216092
Epoch 9711
Loss = 2.0844e-02, PNorm = 813.5246, GNorm = 1.1884, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.199100
Epoch 9712
Loss = 6.3153e-03, PNorm = 813.5505, GNorm = 0.0673, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.239216
Epoch 9713
Loss = 8.5256e-02, PNorm = 813.5745, GNorm = 0.6019, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.173561
Epoch 9714
Loss = 7.4532e-03, PNorm = 813.6027, GNorm = 0.3155, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.181284
Epoch 9715
Loss = 1.6476e-02, PNorm = 813.6296, GNorm = 0.7856, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.186833
Epoch 9716
Loss = 1.2848e-01, PNorm = 813.6497, GNorm = 5.6553, lr_0 = 9.9313e-04
Validation binary_cross_entropy = 0.144667
Epoch 9717
Loss = 1.1901e-02, PNorm = 813.6661, GNorm = 1.4381, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.122630
Epoch 9718
Loss = 1.7677e-01, PNorm = 813.6992, GNorm = 7.5764, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.130584
Epoch 9719
Loss = 5.1122e-02, PNorm = 813.7738, GNorm = 1.0819, lr_0 = 9.9312e-04
Loss = 2.6776e-02, PNorm = 813.8245, GNorm = 0.0855, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.133507
Epoch 9720
Loss = 1.0553e-01, PNorm = 813.8695, GNorm = 2.6128, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.114011
Epoch 9721
Loss = 5.9502e-02, PNorm = 813.9340, GNorm = 0.5588, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.105933
Epoch 9722
Loss = 1.1203e-01, PNorm = 813.9931, GNorm = 1.9405, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.109346
Epoch 9723
Loss = 6.7988e-02, PNorm = 814.0683, GNorm = 8.6449, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.150695
Epoch 9724
Loss = 5.6549e-02, PNorm = 814.1319, GNorm = 5.3929, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.144037
Epoch 9725
Loss = 3.0441e-02, PNorm = 814.1668, GNorm = 0.8580, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.134013
Epoch 9726
Loss = 6.7574e-02, PNorm = 814.2010, GNorm = 5.3700, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.144928
Epoch 9727
Loss = 2.0152e-02, PNorm = 814.2559, GNorm = 3.6596, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.172236
Epoch 9728
Loss = 3.1870e-03, PNorm = 814.3033, GNorm = 0.2968, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.160866
Epoch 9729
Loss = 5.4493e-03, PNorm = 814.3383, GNorm = 0.6670, lr_0 = 9.9312e-04
Loss = 2.5993e-02, PNorm = 814.3764, GNorm = 3.2451, lr_0 = 9.9312e-04
Validation binary_cross_entropy = 0.132436
Epoch 9730
Loss = 2.7142e-02, PNorm = 814.4254, GNorm = 0.3134, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.163272
Epoch 9731
Loss = 3.8350e-02, PNorm = 814.4640, GNorm = 2.5508, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.167781
Epoch 9732
Loss = 1.2722e-02, PNorm = 814.4956, GNorm = 0.6113, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.177761
Epoch 9733
Loss = 3.7271e-02, PNorm = 814.5300, GNorm = 0.7843, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.166226
Epoch 9734
Loss = 9.0295e-02, PNorm = 814.5752, GNorm = 0.3027, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.141181
Epoch 9735
Loss = 2.5717e-02, PNorm = 814.6294, GNorm = 1.1606, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.131153
Epoch 9736
Loss = 1.4874e-01, PNorm = 814.6879, GNorm = 0.4976, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.100271
Epoch 9737
Loss = 1.5648e-01, PNorm = 814.7489, GNorm = 1.0451, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.089782
Epoch 9738
Loss = 8.5455e-02, PNorm = 814.8075, GNorm = 0.9948, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.114691
Epoch 9739
Loss = 1.0877e-02, PNorm = 814.8613, GNorm = 0.4343, lr_0 = 9.9311e-04
Loss = 6.3335e-02, PNorm = 814.9003, GNorm = 0.9977, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.116985
Epoch 9740
Loss = 3.4485e-02, PNorm = 814.9318, GNorm = 2.2954, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.113743
Epoch 9741
Loss = 3.9491e-02, PNorm = 814.9649, GNorm = 0.7107, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.103860
Epoch 9742
Loss = 2.7405e-02, PNorm = 815.0011, GNorm = 1.5750, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.108210
Epoch 9743
Loss = 1.2326e-02, PNorm = 815.0320, GNorm = 0.3508, lr_0 = 9.9311e-04
Validation binary_cross_entropy = 0.106167
Epoch 9744
Loss = 4.7270e-02, PNorm = 815.0577, GNorm = 1.3008, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.101102
Epoch 9745
Loss = 2.1808e-02, PNorm = 815.0902, GNorm = 0.1851, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.101213
Epoch 9746
Loss = 9.6241e-02, PNorm = 815.1320, GNorm = 0.3633, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.108127
Epoch 9747
Loss = 6.0250e-02, PNorm = 815.1742, GNorm = 0.4089, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.081960
Epoch 9748
Loss = 5.5327e-02, PNorm = 815.2112, GNorm = 1.0981, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.079832
Epoch 9749
Loss = 8.8551e-02, PNorm = 815.2617, GNorm = 7.0264, lr_0 = 9.9310e-04
Loss = 6.3967e-02, PNorm = 815.3106, GNorm = 3.1335, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.091361
Epoch 9750
Loss = 3.4799e-02, PNorm = 815.3532, GNorm = 0.3357, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.094765
Epoch 9751
Loss = 3.6193e-02, PNorm = 815.3820, GNorm = 6.2393, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.091141
Epoch 9752
Loss = 3.0654e-02, PNorm = 815.4262, GNorm = 0.0911, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.100056
Epoch 9753
Loss = 1.8380e-02, PNorm = 815.4704, GNorm = 0.0898, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.099122
Epoch 9754
Loss = 5.1153e-02, PNorm = 815.5001, GNorm = 0.4735, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.081337
Epoch 9755
Loss = 3.8501e-02, PNorm = 815.5602, GNorm = 8.4791, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.083789
Epoch 9756
Loss = 1.9134e-02, PNorm = 815.6267, GNorm = 1.1639, lr_0 = 9.9310e-04
Validation binary_cross_entropy = 0.092657
Epoch 9757
Loss = 3.0806e-02, PNorm = 815.6730, GNorm = 0.9366, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.092978
Epoch 9758
Loss = 6.7224e-03, PNorm = 815.7107, GNorm = 0.2126, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.100404
Epoch 9759
Loss = 5.0369e-01, PNorm = 815.7557, GNorm = 31.8742, lr_0 = 9.9309e-04
Loss = 3.1622e-02, PNorm = 815.8038, GNorm = 0.6525, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.124208
Epoch 9760
Loss = 2.5089e-02, PNorm = 815.8616, GNorm = 0.5405, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.150532
Epoch 9761
Loss = 3.1373e-02, PNorm = 815.9075, GNorm = 1.8629, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.191522
Epoch 9762
Loss = 2.0185e-02, PNorm = 815.9504, GNorm = 0.9641, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.189334
Epoch 9763
Loss = 2.3742e-02, PNorm = 815.9730, GNorm = 0.7301, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.152274
Epoch 9764
Loss = 5.1705e-02, PNorm = 815.9944, GNorm = 0.1348, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.147402
Epoch 9765
Loss = 6.8438e-02, PNorm = 816.0300, GNorm = 11.2735, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.157742
Epoch 9766
Loss = 7.3351e-03, PNorm = 816.0651, GNorm = 1.3917, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.142370
Epoch 9767
Loss = 7.0932e-02, PNorm = 816.0938, GNorm = 4.7419, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.148689
Epoch 9768
Loss = 1.0646e-02, PNorm = 816.1276, GNorm = 0.1981, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.152113
Epoch 9769
Loss = 4.0600e-04, PNorm = 816.1618, GNorm = 0.0368, lr_0 = 9.9309e-04
Loss = 1.9004e-02, PNorm = 816.1910, GNorm = 0.2419, lr_0 = 9.9309e-04
Validation binary_cross_entropy = 0.160469
Epoch 9770
Loss = 5.6811e-02, PNorm = 816.2189, GNorm = 0.3556, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.150386
Epoch 9771
Loss = 4.7678e-02, PNorm = 816.2522, GNorm = 1.9881, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.187094
Epoch 9772
Loss = 4.8420e-02, PNorm = 816.2777, GNorm = 1.6953, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.139525
Epoch 9773
Loss = 1.6432e-01, PNorm = 816.2972, GNorm = 0.4268, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.099723
Epoch 9774
Loss = 8.0077e-02, PNorm = 816.3643, GNorm = 8.3026, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.110624
Epoch 9775
Loss = 2.9031e-02, PNorm = 816.4209, GNorm = 0.7478, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.111637
Epoch 9776
Loss = 5.4883e-02, PNorm = 816.4610, GNorm = 3.7533, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.141696
Epoch 9777
Loss = 1.3615e-01, PNorm = 816.5073, GNorm = 1.6470, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.113847
Epoch 9778
Loss = 1.0571e-02, PNorm = 816.5403, GNorm = 0.8921, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.108719
Epoch 9779
Loss = 1.9019e-02, PNorm = 816.5771, GNorm = 0.9518, lr_0 = 9.9308e-04
Loss = 2.9404e-02, PNorm = 816.6175, GNorm = 0.2903, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.116262
Epoch 9780
Loss = 4.6998e-02, PNorm = 816.6692, GNorm = 0.1508, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.150837
Epoch 9781
Loss = 5.8601e-02, PNorm = 816.7095, GNorm = 0.3429, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.135430
Epoch 9782
Loss = 1.7950e-02, PNorm = 816.7351, GNorm = 0.1875, lr_0 = 9.9308e-04
Validation binary_cross_entropy = 0.122091
Epoch 9783
Loss = 1.6220e-02, PNorm = 816.7646, GNorm = 0.1988, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.126865
Epoch 9784
Loss = 3.6951e-02, PNorm = 816.7906, GNorm = 2.1574, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.137048
Epoch 9785
Loss = 5.3120e-02, PNorm = 816.8207, GNorm = 0.1464, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.128875
Epoch 9786
Loss = 1.5384e-02, PNorm = 816.8469, GNorm = 1.2255, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.121878
Epoch 9787
Loss = 6.8438e-03, PNorm = 816.8732, GNorm = 0.1912, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.118759
Epoch 9788
Loss = 1.1152e-02, PNorm = 816.8950, GNorm = 0.0590, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.123857
Epoch 9789
Loss = 1.2208e-01, PNorm = 816.9326, GNorm = 8.0705, lr_0 = 9.9307e-04
Loss = 2.7681e-02, PNorm = 816.9801, GNorm = 1.6706, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.153567
Epoch 9790
Loss = 6.8828e-02, PNorm = 817.0129, GNorm = 4.6402, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.111997
Epoch 9791
Loss = 4.4390e-02, PNorm = 817.0419, GNorm = 0.2512, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.096233
Epoch 9792
Loss = 5.4772e-02, PNorm = 817.0859, GNorm = 3.4558, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.136512
Epoch 9793
Loss = 6.7741e-02, PNorm = 817.1139, GNorm = 6.5002, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.096501
Epoch 9794
Loss = 1.0041e-01, PNorm = 817.1705, GNorm = 0.3697, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.230251
Epoch 9795
Loss = 1.0044e-01, PNorm = 817.2304, GNorm = 8.2174, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.146769
Epoch 9796
Loss = 9.0819e-02, PNorm = 817.2774, GNorm = 0.1753, lr_0 = 9.9307e-04
Validation binary_cross_entropy = 0.119402
Epoch 9797
Loss = 1.7764e-02, PNorm = 817.3500, GNorm = 0.5960, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.124724
Epoch 9798
Loss = 4.1865e-03, PNorm = 817.4139, GNorm = 0.1768, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.119328
Epoch 9799
Loss = 1.2028e-02, PNorm = 817.4722, GNorm = 0.4865, lr_0 = 9.9306e-04
Loss = 6.2693e-02, PNorm = 817.5274, GNorm = 2.8968, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.128824
Epoch 9800
Loss = 2.2451e-02, PNorm = 817.5724, GNorm = 3.0155, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.149704
Epoch 9801
Loss = 1.7438e-02, PNorm = 817.6031, GNorm = 0.2234, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.131294
Epoch 9802
Loss = 4.1143e-02, PNorm = 817.6287, GNorm = 0.5703, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.129535
Epoch 9803
Loss = 4.5550e-02, PNorm = 817.6618, GNorm = 8.6478, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.165568
Epoch 9804
Loss = 3.9599e-02, PNorm = 817.6846, GNorm = 5.3446, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.151972
Epoch 9805
Loss = 4.9001e-02, PNorm = 817.7010, GNorm = 0.8397, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.130697
Epoch 9806
Loss = 7.3293e-02, PNorm = 817.7206, GNorm = 2.9907, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.119128
Epoch 9807
Loss = 2.0198e-02, PNorm = 817.7564, GNorm = 2.7991, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.186991
Epoch 9808
Loss = 4.4271e-03, PNorm = 817.8051, GNorm = 1.4419, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.205696
Epoch 9809
Loss = 7.3479e-02, PNorm = 817.8366, GNorm = 2.7565, lr_0 = 9.9306e-04
Loss = 1.7289e-02, PNorm = 817.8595, GNorm = 0.0368, lr_0 = 9.9306e-04
Validation binary_cross_entropy = 0.179135
Epoch 9810
Loss = 1.7608e-01, PNorm = 817.8819, GNorm = 9.5204, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.136668
Epoch 9811
Loss = 2.9547e-02, PNorm = 817.9030, GNorm = 1.7155, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.092439
Epoch 9812
Loss = 4.9400e-02, PNorm = 817.9718, GNorm = 1.8273, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.102023
Epoch 9813
Loss = 1.3783e-02, PNorm = 818.0519, GNorm = 0.1888, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.115464
Epoch 9814
Loss = 2.2197e-02, PNorm = 818.1005, GNorm = 1.1198, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.121391
Epoch 9815
Loss = 7.0656e-03, PNorm = 818.1309, GNorm = 1.5174, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.128172
Epoch 9816
Loss = 2.4708e-02, PNorm = 818.1566, GNorm = 0.2068, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.149595
Epoch 9817
Loss = 3.7779e-02, PNorm = 818.1972, GNorm = 2.8316, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.150079
Epoch 9818
Loss = 9.0090e-02, PNorm = 818.2267, GNorm = 1.2063, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.098207
Epoch 9819
Loss = 3.2612e-02, PNorm = 818.2545, GNorm = 1.6133, lr_0 = 9.9305e-04
Loss = 3.1221e-02, PNorm = 818.3135, GNorm = 0.2438, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.085968
Epoch 9820
Loss = 3.0801e-02, PNorm = 818.3697, GNorm = 0.2688, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.099688
Epoch 9821
Loss = 3.1618e-02, PNorm = 818.4135, GNorm = 0.4726, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.103164
Epoch 9822
Loss = 7.2802e-02, PNorm = 818.4520, GNorm = 1.5074, lr_0 = 9.9305e-04
Validation binary_cross_entropy = 0.107332
Epoch 9823
Loss = 2.5339e-02, PNorm = 818.4976, GNorm = 0.2426, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.113435
Epoch 9824
Loss = 3.0200e-02, PNorm = 818.5357, GNorm = 0.2125, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.098597
Epoch 9825
Loss = 2.1407e-02, PNorm = 818.5658, GNorm = 2.0810, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.102112
Epoch 9826
Loss = 9.6228e-03, PNorm = 818.6198, GNorm = 0.5253, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.110673
Epoch 9827
Loss = 1.5195e-02, PNorm = 818.6545, GNorm = 0.1570, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.109619
Epoch 9828
Loss = 3.6348e-03, PNorm = 818.6824, GNorm = 0.2075, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.118884
Epoch 9829
Loss = 1.6783e-02, PNorm = 818.7131, GNorm = 1.3589, lr_0 = 9.9304e-04
Loss = 7.5110e-02, PNorm = 818.7467, GNorm = 0.1019, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.111096
Epoch 9830
Loss = 1.6622e-02, PNorm = 818.7992, GNorm = 0.8187, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.106499
Epoch 9831
Loss = 4.3491e-02, PNorm = 818.8407, GNorm = 1.8350, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.093033
Epoch 9832
Loss = 5.4847e-02, PNorm = 818.8820, GNorm = 5.2276, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.082494
Epoch 9833
Loss = 2.4164e-02, PNorm = 818.9150, GNorm = 10.6433, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.080281
Epoch 9834
Loss = 1.1487e-02, PNorm = 818.9481, GNorm = 0.4729, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.088997
Epoch 9835
Loss = 6.9162e-02, PNorm = 818.9847, GNorm = 0.1201, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.112810
Epoch 9836
Loss = 4.4105e-02, PNorm = 819.0182, GNorm = 0.2254, lr_0 = 9.9304e-04
Validation binary_cross_entropy = 0.097393
Epoch 9837
Loss = 1.1739e-01, PNorm = 819.0341, GNorm = 2.0995, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.085429
Epoch 9838
Loss = 9.4315e-03, PNorm = 819.0559, GNorm = 0.7364, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.092459
Epoch 9839
Loss = 1.8759e-02, PNorm = 819.0922, GNorm = 1.2832, lr_0 = 9.9303e-04
Loss = 1.0418e-02, PNorm = 819.1287, GNorm = 0.8306, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.120494
Epoch 9840
Loss = 1.7980e-02, PNorm = 819.1565, GNorm = 3.8973, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.129194
Epoch 9841
Loss = 3.5451e-02, PNorm = 819.1745, GNorm = 0.1856, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.106296
Epoch 9842
Loss = 2.7502e-02, PNorm = 819.1981, GNorm = 0.8935, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.110455
Epoch 9843
Loss = 2.5996e-02, PNorm = 819.2217, GNorm = 0.3635, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.110892
Epoch 9844
Loss = 2.0500e-02, PNorm = 819.2429, GNorm = 1.6050, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.122696
Epoch 9845
Loss = 1.0210e-02, PNorm = 819.2684, GNorm = 1.5263, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.134629
Epoch 9846
Loss = 1.8106e-02, PNorm = 819.2890, GNorm = 0.2308, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.158366
Epoch 9847
Loss = 1.2223e-02, PNorm = 819.3087, GNorm = 2.4236, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.139775
Epoch 9848
Loss = 8.7055e-02, PNorm = 819.3226, GNorm = 3.9743, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.104643
Epoch 9849
Loss = 9.3177e-02, PNorm = 819.3499, GNorm = 5.4961, lr_0 = 9.9303e-04
Loss = 1.1554e-02, PNorm = 819.4018, GNorm = 0.2141, lr_0 = 9.9303e-04
Validation binary_cross_entropy = 0.144815
Epoch 9850
Loss = 4.1933e-02, PNorm = 819.4327, GNorm = 2.6740, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.135956
Epoch 9851
Loss = 1.3889e-02, PNorm = 819.4613, GNorm = 0.0582, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.129493
Epoch 9852
Loss = 1.4694e-02, PNorm = 819.4839, GNorm = 0.1653, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.122811
Epoch 9853
Loss = 2.6463e-02, PNorm = 819.5025, GNorm = 2.1897, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.124877
Epoch 9854
Loss = 2.9276e-03, PNorm = 819.5296, GNorm = 0.3519, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.133664
Epoch 9855
Loss = 1.2051e-02, PNorm = 819.5510, GNorm = 0.0128, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.141506
Epoch 9856
Loss = 8.9626e-02, PNorm = 819.5649, GNorm = 0.1787, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.113375
Epoch 9857
Loss = 2.5348e-02, PNorm = 819.5932, GNorm = 0.9912, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.104712
Epoch 9858
Loss = 2.9071e-02, PNorm = 819.6181, GNorm = 1.8180, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.102376
Epoch 9859
Loss = 1.6899e-02, PNorm = 819.6548, GNorm = 1.4744, lr_0 = 9.9302e-04
Loss = 4.9321e-02, PNorm = 819.6962, GNorm = 3.1497, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.113242
Epoch 9860
Loss = 4.5282e-02, PNorm = 819.7245, GNorm = 0.6077, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.105963
Epoch 9861
Loss = 2.8057e-02, PNorm = 819.7570, GNorm = 4.4455, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.112295
Epoch 9862
Loss = 5.9787e-02, PNorm = 819.7950, GNorm = 0.2414, lr_0 = 9.9302e-04
Validation binary_cross_entropy = 0.122578
Epoch 9863
Loss = 8.0763e-02, PNorm = 819.8253, GNorm = 0.0984, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.119582
Epoch 9864
Loss = 5.0378e-03, PNorm = 819.8514, GNorm = 0.2695, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.123282
Epoch 9865
Loss = 2.4995e-02, PNorm = 819.8754, GNorm = 0.1185, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.133757
Epoch 9866
Loss = 2.4741e-02, PNorm = 819.8964, GNorm = 2.1418, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.123552
Epoch 9867
Loss = 1.7193e-02, PNorm = 819.9117, GNorm = 0.0224, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.120277
Epoch 9868
Loss = 4.8463e-03, PNorm = 819.9319, GNorm = 0.4732, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.148804
Epoch 9869
Loss = 7.0502e-04, PNorm = 819.9644, GNorm = 0.0494, lr_0 = 9.9301e-04
Loss = 4.4826e-02, PNorm = 819.9805, GNorm = 1.0443, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.130896
Epoch 9870
Loss = 2.2639e-02, PNorm = 819.9927, GNorm = 1.0313, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.106620
Epoch 9871
Loss = 1.7366e-02, PNorm = 820.0222, GNorm = 0.6006, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.109349
Epoch 9872
Loss = 1.1803e-02, PNorm = 820.0640, GNorm = 0.2536, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.115683
Epoch 9873
Loss = 1.2616e-02, PNorm = 820.1010, GNorm = 1.9622, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.126700
Epoch 9874
Loss = 5.1326e-03, PNorm = 820.1259, GNorm = 0.0877, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.140299
Epoch 9875
Loss = 3.5000e-03, PNorm = 820.1476, GNorm = 0.1768, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.158974
Epoch 9876
Loss = 2.0148e-02, PNorm = 820.1670, GNorm = 0.0625, lr_0 = 9.9301e-04
Validation binary_cross_entropy = 0.127557
Epoch 9877
Loss = 1.1675e-02, PNorm = 820.1832, GNorm = 2.0755, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.131842
Epoch 9878
Loss = 1.5973e-02, PNorm = 820.2050, GNorm = 2.2049, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.123128
Epoch 9879
Loss = 1.3848e-03, PNorm = 820.2274, GNorm = 0.1066, lr_0 = 9.9300e-04
Loss = 2.4138e-02, PNorm = 820.2574, GNorm = 0.9283, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.131469
Epoch 9880
Loss = 7.2814e-03, PNorm = 820.2868, GNorm = 1.5260, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.143313
Epoch 9881
Loss = 4.7534e-03, PNorm = 820.3133, GNorm = 0.0962, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.160170
Epoch 9882
Loss = 8.5831e-03, PNorm = 820.3309, GNorm = 1.0870, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.176273
Epoch 9883
Loss = 2.5471e-02, PNorm = 820.3528, GNorm = 0.0782, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.178081
Epoch 9884
Loss = 1.8137e-02, PNorm = 820.3699, GNorm = 2.5340, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.165402
Epoch 9885
Loss = 1.0892e-02, PNorm = 820.3833, GNorm = 0.1078, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.147483
Epoch 9886
Loss = 1.6454e-02, PNorm = 820.3951, GNorm = 0.1772, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.135974
Epoch 9887
Loss = 8.1326e-02, PNorm = 820.4134, GNorm = 7.0959, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.158465
Epoch 9888
Loss = 3.0922e-02, PNorm = 820.4390, GNorm = 3.3640, lr_0 = 9.9300e-04
Validation binary_cross_entropy = 0.159247
Epoch 9889
Loss = 1.9479e-03, PNorm = 820.4621, GNorm = 0.1405, lr_0 = 9.9300e-04
Loss = 4.5060e-02, PNorm = 820.4924, GNorm = 11.6981, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.163864
Epoch 9890
Loss = 7.4421e-02, PNorm = 820.5288, GNorm = 10.0255, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.125456
Epoch 9891
Loss = 8.3554e-02, PNorm = 820.5687, GNorm = 5.7258, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.097784
Epoch 9892
Loss = 1.0767e-01, PNorm = 820.6528, GNorm = 1.5869, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.113320
Epoch 9893
Loss = 2.1253e-02, PNorm = 820.7097, GNorm = 1.3717, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.085622
Epoch 9894
Loss = 6.2614e-02, PNorm = 820.7605, GNorm = 3.9651, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.083586
Epoch 9895
Loss = 3.5101e-02, PNorm = 820.7989, GNorm = 0.8745, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.089107
Epoch 9896
Loss = 1.0803e-01, PNorm = 820.8392, GNorm = 0.3472, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.095278
Epoch 9897
Loss = 3.7925e-02, PNorm = 820.8745, GNorm = 3.9054, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.108659
Epoch 9898
Loss = 2.0851e-02, PNorm = 820.9042, GNorm = 1.0014, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.118619
Epoch 9899
Loss = 1.8581e-02, PNorm = 820.9262, GNorm = 1.1627, lr_0 = 9.9299e-04
Loss = 3.9793e-02, PNorm = 820.9441, GNorm = 0.2271, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.115381
Epoch 9900
Loss = 6.2447e-02, PNorm = 820.9669, GNorm = 1.2438, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.113878
Epoch 9901
Loss = 4.3732e-02, PNorm = 821.0138, GNorm = 1.3431, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.186338
Epoch 9902
Loss = 5.0722e-02, PNorm = 821.0509, GNorm = 0.7280, lr_0 = 9.9299e-04
Validation binary_cross_entropy = 0.170494
Epoch 9903
Loss = 2.7037e-02, PNorm = 821.0796, GNorm = 1.8616, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.149574
Epoch 9904
Loss = 8.2586e-03, PNorm = 821.1018, GNorm = 0.3731, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.145491
Epoch 9905
Loss = 3.6880e-02, PNorm = 821.1294, GNorm = 0.1989, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.151861
Epoch 9906
Loss = 3.6498e-02, PNorm = 821.1582, GNorm = 0.8146, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.121079
Epoch 9907
Loss = 1.1256e-02, PNorm = 821.1882, GNorm = 1.4023, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.123432
Epoch 9908
Loss = 1.6118e-01, PNorm = 821.2218, GNorm = 8.0740, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.104135
Epoch 9909
Loss = 7.4442e-03, PNorm = 821.2586, GNorm = 0.2686, lr_0 = 9.9298e-04
Loss = 4.9900e-02, PNorm = 821.2987, GNorm = 0.7003, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.102470
Epoch 9910
Loss = 3.4182e-02, PNorm = 821.3321, GNorm = 0.8436, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.106097
Epoch 9911
Loss = 7.2572e-02, PNorm = 821.3701, GNorm = 0.2645, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.096738
Epoch 9912
Loss = 2.0975e-02, PNorm = 821.4147, GNorm = 0.4922, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.097912
Epoch 9913
Loss = 1.2109e-02, PNorm = 821.4586, GNorm = 0.4037, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.103141
Epoch 9914
Loss = 1.3015e-02, PNorm = 821.4954, GNorm = 0.1031, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.112595
Epoch 9915
Loss = 1.8821e-02, PNorm = 821.5245, GNorm = 0.7719, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.121477
Epoch 9916
Loss = 1.7862e-02, PNorm = 821.5485, GNorm = 0.2610, lr_0 = 9.9298e-04
Validation binary_cross_entropy = 0.168300
Epoch 9917
Loss = 3.7448e-02, PNorm = 821.5703, GNorm = 0.0990, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.129478
Epoch 9918
Loss = 4.0327e-02, PNorm = 821.5862, GNorm = 1.1142, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.121445
Epoch 9919
Loss = 5.8346e-02, PNorm = 821.6103, GNorm = 4.7717, lr_0 = 9.9297e-04
Loss = 1.9112e-02, PNorm = 821.6362, GNorm = 3.3775, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.133424
Epoch 9920
Loss = 2.1180e-02, PNorm = 821.6562, GNorm = 0.0963, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.129384
Epoch 9921
Loss = 1.0415e-02, PNorm = 821.6697, GNorm = 0.8097, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.130420
Epoch 9922
Loss = 4.5220e-02, PNorm = 821.6775, GNorm = 4.5073, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.109431
Epoch 9923
Loss = 2.3695e-02, PNorm = 821.7056, GNorm = 6.3274, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.105845
Epoch 9924
Loss = 9.4314e-03, PNorm = 821.7412, GNorm = 0.1773, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.107496
Epoch 9925
Loss = 1.7890e-02, PNorm = 821.7766, GNorm = 0.9532, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.120719
Epoch 9926
Loss = 2.6847e-02, PNorm = 821.8112, GNorm = 0.1727, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.128213
Epoch 9927
Loss = 5.8154e-02, PNorm = 821.8383, GNorm = 2.0687, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.125265
Epoch 9928
Loss = 8.1575e-02, PNorm = 821.8578, GNorm = 3.1467, lr_0 = 9.9297e-04
Validation binary_cross_entropy = 0.116755
Epoch 9929
Loss = 7.4214e-03, PNorm = 821.8762, GNorm = 0.4111, lr_0 = 9.9297e-04
Loss = 2.9889e-02, PNorm = 821.9023, GNorm = 0.1586, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.139264
Epoch 9930
Loss = 7.2515e-03, PNorm = 821.9338, GNorm = 1.0452, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.140105
Epoch 9931
Loss = 2.7525e-02, PNorm = 821.9754, GNorm = 1.7368, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.152540
Epoch 9932
Loss = 3.9980e-02, PNorm = 822.0238, GNorm = 0.1804, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.128083
Epoch 9933
Loss = 1.1237e-02, PNorm = 822.0620, GNorm = 1.4364, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.117982
Epoch 9934
Loss = 1.1679e-02, PNorm = 822.0978, GNorm = 0.1351, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.119161
Epoch 9935
Loss = 4.4335e-02, PNorm = 822.1308, GNorm = 3.1053, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.121399
Epoch 9936
Loss = 1.2291e-01, PNorm = 822.1634, GNorm = 6.9663, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.126929
Epoch 9937
Loss = 5.7510e-03, PNorm = 822.2134, GNorm = 0.2690, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.176632
Epoch 9938
Loss = 1.7899e-02, PNorm = 822.2531, GNorm = 1.8270, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.124798
Epoch 9939
Loss = 8.6146e-03, PNorm = 822.2815, GNorm = 0.4886, lr_0 = 9.9296e-04
Loss = 3.8499e-01, PNorm = 822.3443, GNorm = 9.1131, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.069035
Epoch 9940
Loss = 1.7862e-01, PNorm = 822.4468, GNorm = 5.6151, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.079128
Epoch 9941
Loss = 1.0837e-01, PNorm = 822.5557, GNorm = 5.3002, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.106675
Epoch 9942
Loss = 1.7144e-01, PNorm = 822.6433, GNorm = 2.4125, lr_0 = 9.9296e-04
Validation binary_cross_entropy = 0.118508
Epoch 9943
Loss = 6.8812e-02, PNorm = 822.7313, GNorm = 3.7186, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.141923
Epoch 9944
Loss = 1.8700e-01, PNorm = 822.8215, GNorm = 3.7963, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.109417
Epoch 9945
Loss = 3.3313e-01, PNorm = 822.8938, GNorm = 1.2996, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.168365
Epoch 9946
Loss = 7.6022e-01, PNorm = 822.9625, GNorm = 3.7405, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.138206
Epoch 9947
Loss = 6.5525e-02, PNorm = 823.0511, GNorm = 4.1319, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.118373
Epoch 9948
Loss = 7.6113e-02, PNorm = 823.1210, GNorm = 1.9354, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.103385
Epoch 9949
Loss = 6.8937e-02, PNorm = 823.1836, GNorm = 3.9127, lr_0 = 9.9295e-04
Loss = 6.8670e-02, PNorm = 823.2419, GNorm = 2.0040, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.107863
Epoch 9950
Loss = 4.7113e-02, PNorm = 823.2978, GNorm = 4.8230, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.094088
Epoch 9951
Loss = 3.9965e-02, PNorm = 823.3448, GNorm = 5.6826, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.088784
Epoch 9952
Loss = 2.8100e-02, PNorm = 823.3947, GNorm = 0.0370, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.115534
Epoch 9953
Loss = 2.1904e-02, PNorm = 823.4316, GNorm = 0.4936, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.150728
Epoch 9954
Loss = 6.7107e-02, PNorm = 823.4716, GNorm = 6.2068, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.110668
Epoch 9955
Loss = 1.7841e-02, PNorm = 823.5133, GNorm = 1.2833, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.098426
Epoch 9956
Loss = 9.6432e-03, PNorm = 823.5575, GNorm = 0.5031, lr_0 = 9.9295e-04
Validation binary_cross_entropy = 0.105352
Epoch 9957
Loss = 1.5734e-01, PNorm = 823.6007, GNorm = 7.8321, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.091298
Epoch 9958
Loss = 7.0042e-02, PNorm = 823.6395, GNorm = 2.2874, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.078303
Epoch 9959
Loss = 1.4606e-02, PNorm = 823.6802, GNorm = 0.5686, lr_0 = 9.9294e-04
Loss = 6.2733e-02, PNorm = 823.7345, GNorm = 1.9172, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.095995
Epoch 9960
Loss = 5.6013e-02, PNorm = 823.7913, GNorm = 0.8406, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.100910
Epoch 9961
Loss = 1.1453e-02, PNorm = 823.8353, GNorm = 0.9934, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.115974
Epoch 9962
Loss = 5.6429e-02, PNorm = 823.8608, GNorm = 0.9253, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.095010
Epoch 9963
Loss = 2.2977e-02, PNorm = 823.9056, GNorm = 1.0558, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.118983
Epoch 9964
Loss = 1.5063e-02, PNorm = 823.9478, GNorm = 0.2780, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.114437
Epoch 9965
Loss = 2.4615e-02, PNorm = 823.9737, GNorm = 1.1505, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.107756
Epoch 9966
Loss = 4.3924e-02, PNorm = 824.0224, GNorm = 0.1385, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.072140
Epoch 9967
Loss = 1.3746e-01, PNorm = 824.2359, GNorm = 9.9109, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.256054
Epoch 9968
Loss = 2.7089e-01, PNorm = 824.4271, GNorm = 8.5956, lr_0 = 9.9294e-04
Validation binary_cross_entropy = 0.224440
Epoch 9969
Loss = 1.1525e-01, PNorm = 824.5650, GNorm = 5.3045, lr_0 = 9.9294e-04
Loss = 2.1762e-01, PNorm = 824.6468, GNorm = 2.9227, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.161426
Epoch 9970
Loss = 1.6223e-01, PNorm = 824.7185, GNorm = 9.2101, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.170176
Epoch 9971
Loss = 1.4429e-01, PNorm = 824.7858, GNorm = 1.8741, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.225317
Epoch 9972
Loss = 2.1126e-01, PNorm = 824.8471, GNorm = 2.6947, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.121957
Epoch 9973
Loss = 2.4621e-01, PNorm = 824.9319, GNorm = 13.6163, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.224583
Epoch 9974
Loss = 3.0739e-01, PNorm = 825.0365, GNorm = 14.2748, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.172855
Epoch 9975
Loss = 3.4586e-01, PNorm = 825.1273, GNorm = 31.5756, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.331715
Epoch 9976
Loss = 2.0225e-01, PNorm = 825.2149, GNorm = 8.3828, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.227176
Epoch 9977
Loss = 1.5004e-01, PNorm = 825.2917, GNorm = 3.0087, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.383154
Epoch 9978
Loss = 3.0554e-01, PNorm = 825.3620, GNorm = 22.9524, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.095101
Epoch 9979
Loss = 4.4055e-02, PNorm = 825.4223, GNorm = 2.4299, lr_0 = 9.9293e-04
Loss = 1.1172e-01, PNorm = 825.4923, GNorm = 1.9249, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.218013
Epoch 9980
Loss = 1.4941e-01, PNorm = 825.5352, GNorm = 21.6235, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.178344
Epoch 9981
Loss = 1.3797e-01, PNorm = 825.5847, GNorm = 3.0787, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.129911
Epoch 9982
Loss = 1.3242e-01, PNorm = 825.6375, GNorm = 6.3532, lr_0 = 9.9293e-04
Validation binary_cross_entropy = 0.112036
Epoch 9983
Loss = 7.0872e-02, PNorm = 825.6883, GNorm = 2.5552, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.174245
Epoch 9984
Loss = 7.3757e-02, PNorm = 825.7346, GNorm = 3.3606, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.078497
Epoch 9985
Loss = 1.0643e-01, PNorm = 825.7935, GNorm = 8.3590, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.173149
Epoch 9986
Loss = 1.5762e-01, PNorm = 825.8574, GNorm = 7.3801, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.128614
Epoch 9987
Loss = 7.7510e-02, PNorm = 825.8974, GNorm = 3.0335, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.101228
Epoch 9988
Loss = 4.5056e-02, PNorm = 825.9487, GNorm = 3.1571, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.169534
Epoch 9989
Loss = 9.9528e-02, PNorm = 825.9985, GNorm = 3.1739, lr_0 = 9.9292e-04
Loss = 7.5221e-02, PNorm = 826.0338, GNorm = 1.3067, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.107541
Epoch 9990
Loss = 5.3182e-02, PNorm = 826.0690, GNorm = 0.2347, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.098351
Epoch 9991
Loss = 3.1982e-02, PNorm = 826.1088, GNorm = 1.0555, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.132179
Epoch 9992
Loss = 4.9123e-02, PNorm = 826.1410, GNorm = 1.5642, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.117320
Epoch 9993
Loss = 9.7493e-02, PNorm = 826.1746, GNorm = 17.8016, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.183845
Epoch 9994
Loss = 1.6329e-01, PNorm = 826.2262, GNorm = 11.0107, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.135885
Epoch 9995
Loss = 4.9999e-02, PNorm = 826.2762, GNorm = 2.3992, lr_0 = 9.9292e-04
Validation binary_cross_entropy = 0.130634
Epoch 9996
Loss = 3.6654e-02, PNorm = 826.3288, GNorm = 2.7709, lr_0 = 9.9291e-04
Validation binary_cross_entropy = 0.093031
Epoch 9997
Loss = 1.9521e-01, PNorm = 826.4169, GNorm = 22.6411, lr_0 = 9.9291e-04
Validation binary_cross_entropy = 0.247806
Epoch 9998
Loss = 1.6487e-01, PNorm = 826.5177, GNorm = 8.6092, lr_0 = 9.9291e-04
Validation binary_cross_entropy = 0.105004
Epoch 9999
Loss = 1.9228e-01, PNorm = 826.5933, GNorm = 7.9617, lr_0 = 9.9291e-04
Loss = 1.9903e-01, PNorm = 826.6656, GNorm = 6.6641, lr_0 = 9.9291e-04
Validation binary_cross_entropy = 0.143201
Model 0 best validation binary_cross_entropy = 0.028044 on epoch 452
Loading pretrained parameter "encoder.encoder.0.cached_zero_vector".
Loading pretrained parameter "encoder.encoder.0.W_i.weight".
Loading pretrained parameter "encoder.encoder.0.W_h.weight".
Loading pretrained parameter "encoder.encoder.0.W_o.weight".
Loading pretrained parameter "encoder.encoder.0.W_o.bias".
Loading pretrained parameter "ffn.1.weight".
Loading pretrained parameter "ffn.1.bias".
Loading pretrained parameter "ffn.4.weight".
Loading pretrained parameter "ffn.4.bias".
Loading pretrained parameter "ffn.7.weight".
Loading pretrained parameter "ffn.7.bias".
Moving model to cuda
Model 0 test binary_cross_entropy = 0.026217
Ensemble test binary_cross_entropy = 0.026217
1-fold cross validation
	Seed 0 ==> test binary_cross_entropy = 0.026217
Overall test binary_cross_entropy = 0.026217 +/- 0.000000
Elapsed time = 3 days, 5:19:16
